saumitras01/shawgpt-ft

Files changed (9) hide show

README.md CHANGED Viewed

@@ -16,7 +16,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [TheBloke/Mistral-7B-Instruct-v0.2-GPTQ](https://huggingface.co/TheBloke/Mistral-7B-Instruct-v0.2-GPTQ) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 1.8899
 ## Model description
@@ -45,28 +45,27 @@ The following hyperparameters were used during training:
 - lr_scheduler_type: linear
 - lr_scheduler_warmup_steps: 2
 - num_epochs: 10
-- mixed_precision_training: Native AMP
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
-| 4.5947        | 0.92  | 3    | 3.9682          |
-| 4.058         | 1.85  | 6    | 3.4521          |
-| 3.4874        | 2.77  | 9    | 2.9993          |
-| 2.2713        | 4.0   | 13   | 2.5722          |
-| 2.698         | 4.92  | 16   | 2.3416          |
-| 2.3939        | 5.85  | 19   | 2.1543          |
-| 2.187         | 6.77  | 22   | 2.0294          |
-| 1.5495        | 8.0   | 26   | 1.9548          |
-| 2.0129        | 8.92  | 29   | 1.9055          |
-| 1.3924        | 9.23  | 30   | 1.8899          |
 ### Framework versions
 - PEFT 0.9.0
-- Transformers 4.38.1
-- Pytorch 2.1.0+cu121
 - Datasets 2.18.0
 - Tokenizers 0.15.2

 This model is a fine-tuned version of [TheBloke/Mistral-7B-Instruct-v0.2-GPTQ](https://huggingface.co/TheBloke/Mistral-7B-Instruct-v0.2-GPTQ) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 4.2329
 ## Model description
 - lr_scheduler_type: linear
 - lr_scheduler_warmup_steps: 2
 - num_epochs: 10
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
+| 4.6496        | 0.92  | 3    | 4.2329          |
+| 4.6271        | 1.85  | 6    | 4.2329          |
+| 4.6451        | 2.77  | 9    | 4.2329          |
+| 3.4785        | 4.0   | 13   | 4.2329          |
+| 4.654         | 4.92  | 16   | 4.2329          |
+| 4.6348        | 5.85  | 19   | 4.2329          |
+| 4.6483        | 6.77  | 22   | 4.2329          |
+| 3.4571        | 8.0   | 26   | 4.2329          |
+| 4.6148        | 8.92  | 29   | 4.2329          |
+| 3.0699        | 9.23  | 30   | 4.2329          |
 ### Framework versions
 - PEFT 0.9.0
+- Transformers 4.39.0.dev0
+- Pytorch 2.2.1+cu121
 - Datasets 2.18.0
 - Tokenizers 0.15.2

runs/Mar06_12-53-42_instance-20240306-123153/events.out.tfevents.1709729622.instance-20240306-123153 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:9598cac9dfc351c185c56fcb0a4ea497e0463c7b2495d8da27dffda0a1e83726
+size 5172

runs/Mar06_13-33-49_instance-20240306-123153/events.out.tfevents.1709732029.instance-20240306-123153 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:fe1fcf646b81316eaba879a3304e423e95e65a45c258eb078dd9f6b914b8be10
+size 5177

runs/Mar06_13-35-48_instance-20240306-123153/events.out.tfevents.1709732148.instance-20240306-123153 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:48d836969038fef65c66fdf3e5443f0c2089486f2ce02a2bc98c5f06d190aeb0
+size 5177

runs/Mar06_13-41-15_instance-20240306-123153/events.out.tfevents.1709732475.instance-20240306-123153 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:abfbaa90710536115f96b4d195b7626951b8ebdf50265a8cb0a97fdae9e7c78e
+size 5177

runs/Mar06_14-20-08_instance-20240306-123153/events.out.tfevents.1709734808.instance-20240306-123153 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:0bfa05e7fe9194a01f06649eed108554787e5483a19189262a7089b5d919d26b
+size 5204

runs/Mar06_14-21-10_instance-20240306-123153/events.out.tfevents.1709734870.instance-20240306-123153 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:c24f64228558951f3d7945e7bafdce67ab01ac8c066b9dc78375db2e46d79369
+size 10283

runs/Mar06_14-24-03_instance-20240306-123153/events.out.tfevents.1709735044.instance-20240306-123153 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:ecb5b990a8b18562dfb859e2a18bd4053d38e9cee22bdafcdef3e2fe325ef7fa
+size 10283

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:05da5d2163f31e7dca961d3c9381b6855f332ba90b26fb5c3225552889bfaa79
 size 4856

 version https://git-lfs.github.com/spec/v1
+oid sha256:386d01f80bbabdbf45af886bf53a46615012ac954d5eb495ed5e8531725cb2d4
 size 4856