argilla
/

distilabeled-Marcoro14-7B-slerp-full

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

plaguss HF staff commited on Jan 14

Commit

d6d029a

•

1 Parent(s): 66ff9d1

README.md

Files changed (1) hide show

README.md +29 -0

README.md ADDED Viewed

	@@ -0,0 +1,29 @@

+---
+license: apache-2.0
+datasets:
+  - argilla/distilabel-intel-orca-dpo-pairs
+language:
+  - en
+tags:
+  - distilabel
+  - dpo
+  - rlaif
+  - rlhf
+  - merge
+  - mergekit
+---
+# ⚗️ distilabeled Marcoro14 7B Slerp
+<p align="center">
+  <a href="https://github.com/argilla-io/distilabel">
+    <img src="https://raw.githubusercontent.com/argilla-io/distilabel/main/docs/assets/distilabel-badge-light.png" alt="Built with Distilabel" width="200" height="32"/>
+  </a>
+</p>
+## Introduction
+This model is a new DPO fine-tune of our new open dataset [argilla/distilabel-intel-orca-dpo-pairs](https://huggingface.co/datasets/argilla/distilabel-intel-orca-dpo-pairs), on the [mlabonne/Marcoro14-7B-slerp](https://huggingface.co/mlabonne/Marcoro14-7B-slerp) model. You can find more information of the "distilabeled" dataset used at this repo [argilla/distilabeled-Hermes-2.5-Mistral-7B](https://huggingface.co/argilla/distilabeled-Hermes-2.5-Mistral-7B/blob/main/README.md#introduction), and visit [distilabel](https://github.com/argilla-io/distilabel).
+This version the same version of [argilla/distilabeled-Marcoro14-7B-slerp](https://huggingface.co/argilla/distilabeled-Marcoro14-7B-slerp), but fine-tuned for a full epoch on our dataset.