HCHSmost commited on
Commit
33f8479
1 Parent(s): c722a7c

HCHSmost/TerryGPT-ft

Browse files
README.md CHANGED
@@ -16,7 +16,7 @@ should probably proofread and complete it, then remove this comment. -->
16
 
17
  This model is a fine-tuned version of [TheBloke/Mistral-7B-Instruct-v0.2-GPTQ](https://huggingface.co/TheBloke/Mistral-7B-Instruct-v0.2-GPTQ) on an unknown dataset.
18
  It achieves the following results on the evaluation set:
19
- - Loss: 0.5955
20
 
21
  ## Model description
22
 
@@ -51,15 +51,15 @@ The following hyperparameters were used during training:
51
 
52
  | Training Loss | Epoch | Step | Validation Loss |
53
  |:-------------:|:------:|:----:|:---------------:|
54
- | 2.0675 | 0.8696 | 5 | 1.4865 |
55
- | 1.1877 | 1.9130 | 11 | 1.0028 |
56
- | 0.8015 | 2.9565 | 17 | 0.7186 |
57
- | 0.6361 | 4.0 | 23 | 0.6500 |
58
- | 0.6824 | 4.8696 | 28 | 0.6235 |
59
- | 0.5246 | 5.9130 | 34 | 0.6054 |
60
- | 0.5007 | 6.9565 | 40 | 0.5981 |
61
- | 0.4786 | 8.0 | 46 | 0.5960 |
62
- | 0.4843 | 8.6957 | 50 | 0.5955 |
63
 
64
 
65
  ### Framework versions
 
16
 
17
  This model is a fine-tuned version of [TheBloke/Mistral-7B-Instruct-v0.2-GPTQ](https://huggingface.co/TheBloke/Mistral-7B-Instruct-v0.2-GPTQ) on an unknown dataset.
18
  It achieves the following results on the evaluation set:
19
+ - Loss: 0.6100
20
 
21
  ## Model description
22
 
 
51
 
52
  | Training Loss | Epoch | Step | Validation Loss |
53
  |:-------------:|:------:|:----:|:---------------:|
54
+ | 2.5684 | 0.8696 | 5 | 1.9494 |
55
+ | 1.578 | 1.9130 | 11 | 1.4150 |
56
+ | 1.1379 | 2.9565 | 17 | 1.0191 |
57
+ | 0.8089 | 4.0 | 23 | 0.7513 |
58
+ | 0.774 | 4.8696 | 28 | 0.6820 |
59
+ | 0.5856 | 5.9130 | 34 | 0.6451 |
60
+ | 0.5621 | 6.9565 | 40 | 0.6235 |
61
+ | 0.5319 | 8.0 | 46 | 0.6122 |
62
+ | 0.5447 | 8.6957 | 50 | 0.6100 |
63
 
64
 
65
  ### Framework versions
adapter_model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:28817ba7056bb033a49665266fa00f5c570e4223c959349468bb2a63457be434
3
  size 8397056
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:66422ce2903ebf37cfe1ee3cdc201d1268cb34c9894771d555d691f65d404898
3
  size 8397056
runs/Jul19_09-34-15_60bc7329ebf4/events.out.tfevents.1721381656.60bc7329ebf4.257.0 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:bfe0292ac227754d2461f897c408aa47030b94817368b801e77d498472818af7
3
+ size 10079
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:e976f53f6489e1d124f038ecc367949f55628b4267ce4a64282a45d2c88f94e6
3
  size 5112
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:34319982afab6b0603c3cf3bc3e72971fff53b515aaac58d8da9a3d60c424a1e
3
  size 5112