HCHSmost commited on
Commit
8276728
1 Parent(s): 33f8479

HCHSmost/TerryGPT-refined_version

Browse files
README.md CHANGED
@@ -16,7 +16,7 @@ should probably proofread and complete it, then remove this comment. -->
16
 
17
  This model is a fine-tuned version of [TheBloke/Mistral-7B-Instruct-v0.2-GPTQ](https://huggingface.co/TheBloke/Mistral-7B-Instruct-v0.2-GPTQ) on an unknown dataset.
18
  It achieves the following results on the evaluation set:
19
- - Loss: 0.6100
20
 
21
  ## Model description
22
 
@@ -51,15 +51,15 @@ The following hyperparameters were used during training:
51
 
52
  | Training Loss | Epoch | Step | Validation Loss |
53
  |:-------------:|:------:|:----:|:---------------:|
54
- | 2.5684 | 0.8696 | 5 | 1.9494 |
55
- | 1.578 | 1.9130 | 11 | 1.4150 |
56
- | 1.1379 | 2.9565 | 17 | 1.0191 |
57
- | 0.8089 | 4.0 | 23 | 0.7513 |
58
- | 0.774 | 4.8696 | 28 | 0.6820 |
59
- | 0.5856 | 5.9130 | 34 | 0.6451 |
60
- | 0.5621 | 6.9565 | 40 | 0.6235 |
61
- | 0.5319 | 8.0 | 46 | 0.6122 |
62
- | 0.5447 | 8.6957 | 50 | 0.6100 |
63
 
64
 
65
  ### Framework versions
 
16
 
17
  This model is a fine-tuned version of [TheBloke/Mistral-7B-Instruct-v0.2-GPTQ](https://huggingface.co/TheBloke/Mistral-7B-Instruct-v0.2-GPTQ) on an unknown dataset.
18
  It achieves the following results on the evaluation set:
19
+ - Loss: 0.7287
20
 
21
  ## Model description
22
 
 
51
 
52
  | Training Loss | Epoch | Step | Validation Loss |
53
  |:-------------:|:------:|:----:|:---------------:|
54
+ | 2.5921 | 0.8696 | 5 | 2.0205 |
55
+ | 1.5501 | 1.9130 | 11 | 1.4288 |
56
+ | 1.1411 | 2.9565 | 17 | 1.0682 |
57
+ | 0.8805 | 4.0 | 23 | 0.8740 |
58
+ | 0.8972 | 4.8696 | 28 | 0.8145 |
59
+ | 0.7032 | 5.9130 | 34 | 0.7677 |
60
+ | 0.6435 | 6.9565 | 40 | 0.7422 |
61
+ | 0.6246 | 8.0 | 46 | 0.7309 |
62
+ | 0.6399 | 8.6957 | 50 | 0.7287 |
63
 
64
 
65
  ### Framework versions
adapter_model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:66422ce2903ebf37cfe1ee3cdc201d1268cb34c9894771d555d691f65d404898
3
  size 8397056
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:3c2fd7fa62a79629b5063f8e525fe3efbaeb411c365a3c24ee862b1b4b9e8e7d
3
  size 8397056
runs/Jul21_06-48-36_0b61a5c95088/events.out.tfevents.1721544531.0b61a5c95088.7106.0 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:1e8be3a0ecd6801f2bc942d12243e41dcdfdf34ac3158b70a1c122e88e89181b
3
+ size 10079
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:34319982afab6b0603c3cf3bc3e72971fff53b515aaac58d8da9a3d60c424a1e
3
  size 5112
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:40ec43e4786496af9d25e8b86283ac01dab59146d86648dbbff61a7db5c677c3
3
  size 5112