Safetensors
llama
Llama3-8B-SimPO / README.md
zkshan2002's picture
Create README.md
a9af4f2 verified
---
datasets:
- HuggingFaceH4/ultrafeedback_binarized
base_model:
- OpenRLHF/Llama-3-8b-sft-mixture
---
Base model: [OpenRLHF/Llama-3-8b-sft-mixture](https://huggingface.co/OpenRLHF/Llama-3-8b-sft-mixture)
Preference dataset: [HuggingFaceH4/ultrafeedback_binarized](https://huggingface.co/datasets/HuggingFaceH4/ultrafeedback_binarized)