Model save

Files changed (6) hide show

README.md CHANGED Viewed

@@ -2,15 +2,9 @@
 license: other
 base_model: facebook/opt-2.7b
 tags:
-- alignment-handbook
 - trl
 - dpo
 - generated_from_trainer
-- trl
-- dpo
-- generated_from_trainer
-datasets:
-- RedMist137/AIHF_DPO_iter0
 model-index:
 - name: DPO-Zephyr-7B
   results: []
@@ -21,7 +15,7 @@ should probably proofread and complete it, then remove this comment. -->
 # DPO-Zephyr-7B
-This model is a fine-tuned version of [facebook/opt-2.7b](https://huggingface.co/facebook/opt-2.7b) on the RedMist137/AIHF_DPO_iter0 dataset.
 ## Model description

 license: other
 base_model: facebook/opt-2.7b
 tags:
 - trl
 - dpo
 - generated_from_trainer
 model-index:
 - name: DPO-Zephyr-7B
   results: []
 # DPO-Zephyr-7B
+This model is a fine-tuned version of [facebook/opt-2.7b](https://huggingface.co/facebook/opt-2.7b) on the None dataset.
 ## Model description

all_results.json CHANGED Viewed

@@ -1,9 +1,9 @@
 {
-    "epoch": 0.5192557334487234,
     "total_flos": 0.0,
-    "train_loss": 0.0,
-    "train_runtime": 1.1312,
     "train_samples": 40000,
-    "train_samples_per_second": 35360.967,
-    "train_steps_per_second": 1105.03
 }

 {
+    "epoch": 1.0,
     "total_flos": 0.0,
+    "train_loss": 0.69168408203125,
+    "train_runtime": 25620.0654,
     "train_samples": 40000,
+    "train_samples_per_second": 1.561,
+    "train_steps_per_second": 0.049
 }

model-00001-of-00002.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:dd418fdddd3a80cfefe85a8306f00c640965e6dff3cc61d47601d265c4706f42
 size 4988545248

 version https://git-lfs.github.com/spec/v1
+oid sha256:926d6a1076c6673f80db2020356316e274a18393f7a0675e59c9f9efef0747d6
 size 4988545248

model-00002-of-00002.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:76f398442390d7b851d5567b3e136eb1aaccc772060868b996180ddb996b89d1
 size 314709704

 version https://git-lfs.github.com/spec/v1
+oid sha256:5c4108a4b572d5ea6d9fa334a21e1660fa9d57bf5cb2d8688c19cc15f35a8f9b
 size 314709704

train_results.json CHANGED Viewed

@@ -1,9 +1,9 @@
 {
-    "epoch": 0.5192557334487234,
     "total_flos": 0.0,
-    "train_loss": 0.0,
-    "train_runtime": 1.1312,
     "train_samples": 40000,
-    "train_samples_per_second": 35360.967,
-    "train_steps_per_second": 1105.03
 }

 {
+    "epoch": 1.0,
     "total_flos": 0.0,
+    "train_loss": 0.69168408203125,
+    "train_runtime": 25620.0654,
     "train_samples": 40000,
+    "train_samples_per_second": 1.561,
+    "train_steps_per_second": 0.049
 }

trainer_state.json CHANGED Viewed

The diff for this file is too large to render. See raw diff