nblinh63
/

d7dec48d-6963-4941-a666-82afe3a01311

Generated from Trainer

8-bit precision

Model card Files Files and versions Community

nblinh63 commited on Dec 1, 2024

Commit

f0e5b92

·

verified ·

1 Parent(s): d178288

End of training

Files changed (2) hide show

README.md +4 -4
adapter_model.bin +1 -1

README.md CHANGED Viewed

@@ -103,7 +103,7 @@ xformers_attention: null
 This model is a fine-tuned version of [Qwen/Qwen1.5-0.5B-Chat](https://huggingface.co/Qwen/Qwen1.5-0.5B-Chat) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 2.8077
 ## Model description
@@ -138,9 +138,9 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch  | Step | Validation Loss |
 |:-------------:|:------:|:----:|:---------------:|
 | 5.4024        | 0.0001 | 1    | 4.9536          |
-| 3.6072        | 0.0004 | 3    | 4.8569          |
-| 2.7651        | 0.0009 | 6    | 3.9604          |
-| 3.1305        | 0.0013 | 9    | 2.8077          |
 ### Framework versions

 This model is a fine-tuned version of [Qwen/Qwen1.5-0.5B-Chat](https://huggingface.co/Qwen/Qwen1.5-0.5B-Chat) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 2.8069
 ## Model description
 | Training Loss | Epoch  | Step | Validation Loss |
 |:-------------:|:------:|:----:|:---------------:|
 | 5.4024        | 0.0001 | 1    | 4.9536          |
+| 3.6741        | 0.0004 | 3    | 4.8511          |
+| 2.7762        | 0.0009 | 6    | 3.9513          |
+| 3.1614        | 0.0013 | 9    | 2.8069          |
 ### Framework versions

adapter_model.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:d4f27e58fd697ef5233441900483b448fb0f28a4e1cd4044b4373c3db417cab1
 size 30398410

 version https://git-lfs.github.com/spec/v1
+oid sha256:556025e7f44d14325dba8de7a232666819e6980687df534e570b32150d107f40
 size 30398410