sgimmel commited on
Commit
0a9bb86
1 Parent(s): e4fb7d7

End of training

Browse files
README.md CHANGED
@@ -15,7 +15,7 @@ should probably proofread and complete it, then remove this comment. -->
15
 
16
  This model is a fine-tuned version of [EleutherAI/gpt-neo-125m](https://huggingface.co/EleutherAI/gpt-neo-125m) on the None dataset.
17
  It achieves the following results on the evaluation set:
18
- - Loss: 5.9785
19
 
20
  ## Model description
21
 
@@ -40,17 +40,19 @@ The following hyperparameters were used during training:
40
  - seed: 42
41
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
42
  - lr_scheduler_type: linear
43
- - num_epochs: 5
44
 
45
  ### Training results
46
 
47
  | Training Loss | Epoch | Step | Validation Loss |
48
  |:-------------:|:-----:|:----:|:---------------:|
49
- | No log | 1.0 | 109 | 6.0482 |
50
- | No log | 2.0 | 218 | 5.9956 |
51
- | No log | 3.0 | 327 | 5.9805 |
52
- | No log | 4.0 | 436 | 5.9763 |
53
- | 5.8761 | 5.0 | 545 | 5.9785 |
 
 
54
 
55
 
56
  ### Framework versions
 
15
 
16
  This model is a fine-tuned version of [EleutherAI/gpt-neo-125m](https://huggingface.co/EleutherAI/gpt-neo-125m) on the None dataset.
17
  It achieves the following results on the evaluation set:
18
+ - Loss: 8.4804
19
 
20
  ## Model description
21
 
 
40
  - seed: 42
41
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
42
  - lr_scheduler_type: linear
43
+ - num_epochs: 7
44
 
45
  ### Training results
46
 
47
  | Training Loss | Epoch | Step | Validation Loss |
48
  |:-------------:|:-----:|:----:|:---------------:|
49
+ | No log | 1.0 | 109 | 8.0124 |
50
+ | No log | 2.0 | 218 | 8.1313 |
51
+ | No log | 3.0 | 327 | 8.2502 |
52
+ | No log | 4.0 | 436 | 8.3335 |
53
+ | 1.1381 | 5.0 | 545 | 8.4097 |
54
+ | 1.1381 | 6.0 | 654 | 8.4559 |
55
+ | 1.1381 | 7.0 | 763 | 8.4804 |
56
 
57
 
58
  ### Framework versions
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:d0181d2bd1edb33a4b58a68c8a8b2620d2fcc5535d7c1e6760f244afe2977039
3
  size 500811336
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:7e39cc91cc8431e0ce3a02b4b6fab4ca28f24636739b0098b8359f2023cb4f3c
3
  size 500811336
runs/Nov21_22-56-47_76f7abf549a5/events.out.tfevents.1700607414.76f7abf549a5.47.4 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:0f56be874b65f1faf1e7326eb4004294305832d4de4b3e2270139447e4bf213e
3
- size 5965
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:b4170ea6994f85ea43de9df630547fcd85a76694492c910170d3bf96ce774268
3
+ size 7132
runs/Nov21_22-56-47_76f7abf549a5/events.out.tfevents.1700607567.76f7abf549a5.47.5 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:18ac116ad8ec794bc2947fc6e09fd556fec13fe55ffaac95d4f0b4021d695485
3
+ size 359