plaguss HF staff commited on
Commit
d6d029a
1 Parent(s): 66ff9d1
Files changed (1) hide show
  1. README.md +29 -0
README.md ADDED
@@ -0,0 +1,29 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ datasets:
4
+ - argilla/distilabel-intel-orca-dpo-pairs
5
+ language:
6
+ - en
7
+ tags:
8
+ - distilabel
9
+ - dpo
10
+ - rlaif
11
+ - rlhf
12
+ - merge
13
+ - mergekit
14
+ ---
15
+ # ⚗️ distilabeled Marcoro14 7B Slerp
16
+
17
+
18
+ <p align="center">
19
+ <a href="https://github.com/argilla-io/distilabel">
20
+ <img src="https://raw.githubusercontent.com/argilla-io/distilabel/main/docs/assets/distilabel-badge-light.png" alt="Built with Distilabel" width="200" height="32"/>
21
+ </a>
22
+ </p>
23
+
24
+
25
+ ## Introduction
26
+
27
+ This model is a new DPO fine-tune of our new open dataset [argilla/distilabel-intel-orca-dpo-pairs](https://huggingface.co/datasets/argilla/distilabel-intel-orca-dpo-pairs), on the [mlabonne/Marcoro14-7B-slerp](https://huggingface.co/mlabonne/Marcoro14-7B-slerp) model. You can find more information of the "distilabeled" dataset used at this repo [argilla/distilabeled-Hermes-2.5-Mistral-7B](https://huggingface.co/argilla/distilabeled-Hermes-2.5-Mistral-7B/blob/main/README.md#introduction), and visit [distilabel](https://github.com/argilla-io/distilabel).
28
+
29
+ This version the same version of [argilla/distilabeled-Marcoro14-7B-slerp](https://huggingface.co/argilla/distilabeled-Marcoro14-7B-slerp), but fine-tuned for a full epoch on our dataset.