Christine789/qlora_v2

Files changed (5) hide show

README.md CHANGED Viewed

@@ -16,7 +16,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [TheBloke/Mistral-7B-Instruct-v0.2-GPTQ](https://huggingface.co/TheBloke/Mistral-7B-Instruct-v0.2-GPTQ) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 1.7197
 ## Model description
@@ -51,22 +51,22 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch  | Step | Validation Loss |
 |:-------------:|:------:|:----:|:---------------:|
-| 4.5915        | 0.9231 | 3    | 3.9610          |
-| 4.0346        | 1.8462 | 6    | 3.4295          |
-| 3.4477        | 2.7692 | 9    | 2.9668          |
-| 2.2397        | 4.0    | 13   | 2.5420          |
-| 2.6514        | 4.9231 | 16   | 2.2949          |
-| 2.3244        | 5.8462 | 19   | 2.0976          |
-| 2.0731        | 6.7692 | 22   | 1.9205          |
-| 1.425         | 8.0    | 26   | 1.7735          |
-| 1.7904        | 8.9231 | 29   | 1.7251          |
-| 1.2468        | 9.2308 | 30   | 1.7197          |
 ### Framework versions
-- PEFT 0.12.0
 - Transformers 4.44.2
 - Pytorch 2.4.1+cu121
-- Datasets 3.0.0
 - Tokenizers 0.19.1

 This model is a fine-tuned version of [TheBloke/Mistral-7B-Instruct-v0.2-GPTQ](https://huggingface.co/TheBloke/Mistral-7B-Instruct-v0.2-GPTQ) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 1.7486
 ## Model description
 | Training Loss | Epoch  | Step | Validation Loss |
 |:-------------:|:------:|:----:|:---------------:|
+| 4.5903        | 0.9231 | 3    | 3.9569          |
+| 4.0337        | 1.8462 | 6    | 3.4324          |
+| 3.4578        | 2.7692 | 9    | 2.9754          |
+| 2.2395        | 4.0    | 13   | 2.5348          |
+| 2.6193        | 4.9231 | 16   | 2.2674          |
+| 2.2876        | 5.8462 | 19   | 2.0560          |
+| 2.039         | 6.7692 | 22   | 1.9094          |
+| 1.4379        | 8.0    | 26   | 1.7941          |
+| 1.829         | 8.9231 | 29   | 1.7533          |
+| 1.2737        | 9.2308 | 30   | 1.7486          |
 ### Framework versions
+- PEFT 0.13.0
 - Transformers 4.44.2
 - Pytorch 2.4.1+cu121
+- Datasets 3.0.1
 - Tokenizers 0.19.1

adapter_model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:6b9a954343e817f93a3388bb20f8e2c07492f4b41090a80a1d1de582e5d82a33
 size 8397056

 version https://git-lfs.github.com/spec/v1
+oid sha256:9eaa4b2b4296c49c2bf972e688de5c784bffe0a86a7869809aa4e02aa6908679
 size 8397056

runs/Sep29_14-21-14_4cd08f998954/events.out.tfevents.1727619681.4cd08f998954.2777.0 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:b57e3311f35474bcc17029f8a4a888cdae79d7008cb4313f97d5d6b3d114de58
+size 10647

runs/Sep29_14-49-56_4cd08f998954/events.out.tfevents.1727621415.4cd08f998954.20051.0 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:c33a65a27eb05f63d02f4bb5435c51c45a9a68307515ab294f444cd4f83843bc
+size 10647

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:7e0eb213f810b1f25158d286546697f6187542ce066cad400e7e88e44c24e7a2
 size 5176

 version https://git-lfs.github.com/spec/v1
+oid sha256:597b3d4e3273951727c629056c3d9fa9d7fa24efecd77e6521baa49f81189860
 size 5176