SleepyGorilla/Mistral_7B

Browse files

Files changed (5) hide show

README.md +13 -13
adapter_config.json +2 -2
adapter_model.safetensors +1 -1
runs/Mar18_11-18-31_7b8536c7ca90/events.out.tfevents.1710760802.7b8536c7ca90.253.0 +3 -0
training_args.bin +1 -1

README.md CHANGED Viewed

@@ -18,15 +18,15 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [TheBloke/OpenHermes-2-Mistral-7B-GPTQ](https://huggingface.co/TheBloke/OpenHermes-2-Mistral-7B-GPTQ) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.0132
-- Rewards/chosen: -1.4792
-- Rewards/rejected: -8.5855
 - Rewards/accuracies: 1.0
-- Rewards/margins: 7.1064
-- Logps/rejected: -319.5252
-- Logps/chosen: -138.4254
-- Logits/rejected: -2.3872
-- Logits/chosen: -2.5369
 ## Model description
@@ -59,11 +59,11 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch | Step | Validation Loss | Rewards/chosen | Rewards/rejected | Rewards/accuracies | Rewards/margins | Logps/rejected | Logps/chosen | Logits/rejected | Logits/chosen |
 |:-------------:|:-----:|:----:|:---------------:|:--------------:|:----------------:|:------------------:|:---------------:|:--------------:|:------------:|:---------------:|:-------------:|
-| 0.5575        | 0.0   | 10   | 0.4017          | 0.0150         | -0.6143          | 1.0                | 0.6293          | -239.8125      | -123.4837    | -2.4102         | -2.6084       |
-| 0.3781        | 0.0   | 20   | 0.1298          | -0.2390        | -2.2414          | 1.0                | 2.0025          | -256.0842      | -126.0231    | -2.3786         | -2.6120       |
-| 0.219         | 0.0   | 30   | 0.0410          | -0.5640        | -4.3638          | 1.0                | 3.7998          | -277.3080      | -129.2739    | -2.3879         | -2.5872       |
-| 0.038         | 0.0   | 40   | 0.0168          | -1.2083        | -7.3369          | 1.0                | 6.1286          | -307.0389      | -135.7168    | -2.3962         | -2.5566       |
-| 0.0669        | 0.0   | 50   | 0.0132          | -1.4792        | -8.5855          | 1.0                | 7.1064          | -319.5252      | -138.4254    | -2.3872         | -2.5369       |
 ### Framework versions

 This model is a fine-tuned version of [TheBloke/OpenHermes-2-Mistral-7B-GPTQ](https://huggingface.co/TheBloke/OpenHermes-2-Mistral-7B-GPTQ) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.0323
+- Rewards/chosen: -1.0831
+- Rewards/rejected: -9.5400
 - Rewards/accuracies: 1.0
+- Rewards/margins: 8.4569
+- Logps/rejected: -319.0484
+- Logps/chosen: -82.1537
+- Logits/rejected: -2.6023
+- Logits/chosen: -2.2419
 ## Model description
 | Training Loss | Epoch | Step | Validation Loss | Rewards/chosen | Rewards/rejected | Rewards/accuracies | Rewards/margins | Logps/rejected | Logps/chosen | Logits/rejected | Logits/chosen |
 |:-------------:|:-----:|:----:|:---------------:|:--------------:|:----------------:|:------------------:|:---------------:|:--------------:|:------------:|:---------------:|:-------------:|
+| 0.556         | 0.0   | 10   | 0.4270          | 0.0181         | -0.6933          | 1.0                | 0.7114          | -230.5818      | -71.1419     | -2.6557         | -2.3264       |
+| 0.3645        | 0.0   | 20   | 0.1583          | 0.1759         | -2.5866          | 1.0                | 2.7625          | -249.5149      | -69.5636     | -2.6400         | -2.3441       |
+| 0.2037        | 0.0   | 30   | 0.0681          | 0.0985         | -5.0474          | 1.0                | 5.1459          | -274.1230      | -70.3379     | -2.6245         | -2.3228       |
+| 0.0315        | 0.0   | 40   | 0.0431          | -0.9146        | -8.5841          | 1.0                | 7.6695          | -309.4896      | -80.4684     | -2.6094         | -2.3791       |
+| 0.0655        | 0.0   | 50   | 0.0323          | -1.0831        | -9.5400          | 1.0                | 8.4569          | -319.0484      | -82.1537     | -2.6023         | -2.2419       |
 ### Framework versions

adapter_config.json CHANGED Viewed

@@ -19,8 +19,8 @@
   "rank_pattern": {},
   "revision": null,
   "target_modules": [
-    "q_proj",
-    "v_proj"
   ],
   "task_type": "CAUSAL_LM",
   "use_dora": false,

   "rank_pattern": {},
   "revision": null,
   "target_modules": [
+    "v_proj",
+    "q_proj"
   ],
   "task_type": "CAUSAL_LM",
   "use_dora": false,

adapter_model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:58a9849f918c2040073e354ef8b6860deae289185db34920861266a53e2e876e
 size 13648432

 version https://git-lfs.github.com/spec/v1
+oid sha256:a1466527e1d7ec9ea623a2b1a3644994d886ad1aad67a8fc85e3b410a38dbbb2
 size 13648432

runs/Mar18_11-18-31_7b8536c7ca90/events.out.tfevents.1710760802.7b8536c7ca90.253.0 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:17c72d31d6a07e6665e825cf2ad733b338d99537a96eb15d81aafb05a4bd4941
+size 13334

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:4087bb6f460ec3cc9afe55fb6b825229b72adfd18a1c406f7ebba757842a8f84
 size 4475

 version https://git-lfs.github.com/spec/v1
+oid sha256:1fb2c5d74328222a654b9d9ed2c97b55d65127718bab2605d82a4b5533637423
 size 4475