dmcooller/neural-matia-ft-4

Files changed (5) hide show

README.md CHANGED Viewed

@@ -16,7 +16,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [TheBloke/Mistral-7B-Instruct-v0.2-GPTQ](https://huggingface.co/TheBloke/Mistral-7B-Instruct-v0.2-GPTQ) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.9546
 ## Model description
@@ -35,7 +35,7 @@ More information needed
 ### Training hyperparameters
 The following hyperparameters were used during training:
-- learning_rate: 0.002
 - train_batch_size: 5
 - eval_batch_size: 5
 - seed: 42
@@ -51,18 +51,18 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
-| 3.1044        | 0.95  | 5    | 1.4243          |
-| 0.9363        | 1.9   | 10   | 0.8787          |
-| 0.6967        | 2.86  | 15   | 0.8134          |
-| 0.5521        | 4.0   | 21   | 0.7611          |
-| 0.5971        | 4.95  | 26   | 0.7678          |
-| 0.5431        | 5.9   | 31   | 0.7748          |
-| 0.4871        | 6.86  | 36   | 0.7967          |
-| 0.359         | 8.0   | 42   | 0.8310          |
-| 0.3786        | 8.95  | 47   | 0.8991          |
-| 0.3355        | 9.9   | 52   | 0.9386          |
-| 0.303         | 10.86 | 57   | 0.9548          |
-| 0.2029        | 11.43 | 60   | 0.9546          |
 ### Framework versions

 This model is a fine-tuned version of [TheBloke/Mistral-7B-Instruct-v0.2-GPTQ](https://huggingface.co/TheBloke/Mistral-7B-Instruct-v0.2-GPTQ) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.8002
 ## Model description
 ### Training hyperparameters
 The following hyperparameters were used during training:
+- learning_rate: 0.0002
 - train_batch_size: 5
 - eval_batch_size: 5
 - seed: 42
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
+| 3.8166        | 0.95  | 5    | 3.3249          |
+| 2.9744        | 1.9   | 10   | 2.5352          |
+| 2.2051        | 2.86  | 15   | 1.9065          |
+| 1.288         | 4.0   | 21   | 1.2826          |
+| 1.0574        | 4.95  | 26   | 1.0073          |
+| 0.8263        | 5.9   | 31   | 0.8886          |
+| 0.7487        | 6.86  | 36   | 0.8408          |
+| 0.5904        | 8.0   | 42   | 0.8178          |
+| 0.6909        | 8.95  | 47   | 0.8095          |
+| 0.681         | 9.9   | 52   | 0.8039          |
+| 0.6745        | 10.86 | 57   | 0.8008          |
+| 0.4618        | 11.43 | 60   | 0.8002          |
 ### Framework versions

adapter_model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:3a57ac12db77d7238b30503202e7dc2f58723461b0e8d323437d2c61ce18f948
 size 6300984

 version https://git-lfs.github.com/spec/v1
+oid sha256:2797f1998e7683a76a05fb207bc0782ec4ec7c6237fbfef8c87db22eaae82793
 size 6300984

runs/Apr04_06-57-50_06b299a88d3f/events.out.tfevents.1712213916.06b299a88d3f.34.3 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:da12b40b6b2bfbaa18c609949605323f8d1bc9c6564eb4ae6ab71d5efa59bfea
+size 5236

runs/Apr04_06-59-22_06b299a88d3f/events.out.tfevents.1712213966.06b299a88d3f.34.4 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:336af3000845479ad5a3181c542df9b2f0673424407aa1abf42d57d2ccba7d48
+size 11260

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:7a088111b44056776e5ddb471f84a849fba45e348291e8418cc327759c917db8
 size 4920

 version https://git-lfs.github.com/spec/v1
+oid sha256:f6bd2dd89f06aa97e70272158e4640129f45f3820f5a53689c2250752d2b4131
 size 4920