Edit model card

Model Card for Model ID

This bot gives a bitter review fn any paper you submit. See https://hippocampus-garden.com/tiny_llama_dpo_lora/ for full details.

Model Details

Model Description

  • Developed by: Shion Honda
  • Model type: Text Generation
  • Language(s) (NLP): English
  • License: MIT
  • Finetuned from model: TinyLlama/TinyLlama-1.1B-Chat-v1.0

Model Card Contact

[More Information Needed]

Framework versions

  • PEFT 0.10.0
Downloads last month
0
Inference Examples
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.

Model tree for shionhonda/tiny-llama-reviewer2-1.1B-dpo-lora

Adapter
(491)
this model

Dataset used to train shionhonda/tiny-llama-reviewer2-1.1B-dpo-lora