yleo
/

EmertonMonarch-7B

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

yleo commited on Feb 14

Commit

1a8a1ce

•

1 Parent(s): 5fbbc59

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -10,7 +10,7 @@ tags:
 # 🦜 EmertonMonarch-7B
-EmertonOmniBeagle-7B-dpo is a DPO fine-tune of [mlabonne/Monarch-7B](https://huggingface.co/mlabonne/OmniBeagle-7B) using the [yleo/emerton_dpo_pairs_judge](https://huggingface.co/datasets/yleo/emerton_dpo_pairs_judge) preference dataset created from [Intel/orca_dpo_pairs](https://huggingface.co/datasets/Intel/orca_dpo_pairs) by replacing gpt 3.5 answer by a gpt4 Turbo answer. Then, gpt4 Turbo is put as chosen whereas gpt4 is put as rejected.
 ## 🔍 Applications

 # 🦜 EmertonMonarch-7B
+EmertonOmniBeagle-7B-dpo is a DPO fine-tune of [mlabonne/Monarch-7B](https://huggingface.co/mlabonne/OmniBeagle-7B) using the [yleo/emerton_dpo_pairs_judge](https://huggingface.co/datasets/yleo/emerton_dpo_pairs_judge) preference dataset created from [Intel/orca_dpo_pairs](https://huggingface.co/datasets/Intel/orca_dpo_pairs) by replacing gpt 3.5 answer by a gpt4 Turbo answer. Then, LLM-Blender is used to judge between GPT4 and GPT4 Turbo.
 ## 🔍 Applications