Safetensors
llama
Llama3-8B-SimPO / README.md
zkshan2002's picture
Create README.md
a9af4f2 verified
metadata
datasets:
  - HuggingFaceH4/ultrafeedback_binarized
base_model:
  - OpenRLHF/Llama-3-8b-sft-mixture

Base model: OpenRLHF/Llama-3-8b-sft-mixture

Preference dataset: HuggingFaceH4/ultrafeedback_binarized