Edit model card

Uploaded model

  • Developed by: RLHF-And-Friends
  • License: apache-2.0
  • Finetuned from model : unsloth/Meta-Llama-3.1-8B-bnb-4bit

This llama model was trained 2x faster with Unsloth and Huggingface's TRL library.

Downloads last month
47
Safetensors
Model size
8.03B params
Tensor type
BF16
·
Inference API
Unable to determine this model’s pipeline type. Check the docs .

Model tree for RLHF-And-Friends/Llama3.1-8B-DPO-0.05

Quantized
(201)
this model