sgimmel commited on
Commit
a609eb7
1 Parent(s): 03e1ec1

End of training

Browse files
README.md CHANGED
@@ -15,7 +15,7 @@ should probably proofread and complete it, then remove this comment. -->
15
 
16
  This model is a fine-tuned version of [EleutherAI/gpt-neo-125m](https://huggingface.co/EleutherAI/gpt-neo-125m) on the None dataset.
17
  It achieves the following results on the evaluation set:
18
- - Loss: 4.5221
19
 
20
  ## Model description
21
 
@@ -40,22 +40,17 @@ The following hyperparameters were used during training:
40
  - seed: 42
41
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
42
  - lr_scheduler_type: linear
43
- - num_epochs: 10
44
 
45
  ### Training results
46
 
47
  | Training Loss | Epoch | Step | Validation Loss |
48
  |:-------------:|:-----:|:----:|:---------------:|
49
- | No log | 1.0 | 192 | 4.6578 |
50
- | No log | 2.0 | 384 | 4.5784 |
51
- | 4.5695 | 3.0 | 576 | 4.5419 |
52
- | 4.5695 | 4.0 | 768 | 4.5219 |
53
- | 4.5695 | 5.0 | 960 | 4.5098 |
54
- | 4.1799 | 6.0 | 1152 | 4.5060 |
55
- | 4.1799 | 7.0 | 1344 | 4.5073 |
56
- | 3.9822 | 8.0 | 1536 | 4.5148 |
57
- | 3.9822 | 9.0 | 1728 | 4.5200 |
58
- | 3.9822 | 10.0 | 1920 | 4.5221 |
59
 
60
 
61
  ### Framework versions
 
15
 
16
  This model is a fine-tuned version of [EleutherAI/gpt-neo-125m](https://huggingface.co/EleutherAI/gpt-neo-125m) on the None dataset.
17
  It achieves the following results on the evaluation set:
18
+ - Loss: 5.9785
19
 
20
  ## Model description
21
 
 
40
  - seed: 42
41
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
42
  - lr_scheduler_type: linear
43
+ - num_epochs: 5
44
 
45
  ### Training results
46
 
47
  | Training Loss | Epoch | Step | Validation Loss |
48
  |:-------------:|:-----:|:----:|:---------------:|
49
+ | No log | 1.0 | 109 | 6.0482 |
50
+ | No log | 2.0 | 218 | 5.9956 |
51
+ | No log | 3.0 | 327 | 5.9805 |
52
+ | No log | 4.0 | 436 | 5.9763 |
53
+ | 5.8761 | 5.0 | 545 | 5.9785 |
 
 
 
 
 
54
 
55
 
56
  ### Framework versions
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:4ef3e4c03c6147ed4646c677a824abb1d6c21a6768b60df54bd8d4ec7ef2791d
3
  size 500811336
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:9325ad5f015f6cb3e227edb4621dcf0aca0997072ff6ee3633fe5c85c4da5d5b
3
  size 500811336
runs/Nov21_22-45-59_76f7abf549a5/events.out.tfevents.1700606795.76f7abf549a5.47.0 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:68c80ff677da42b0882f7f88542221c79dd1119cd306c480ec3249d82a4417c1
3
- size 5937
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:bbe12b67dcb641764b6fb117c5e9246faf7de5f2c41f8d5e296ea697d63856a4
3
+ size 6562
runs/Nov21_22-45-59_76f7abf549a5/events.out.tfevents.1700606956.76f7abf549a5.47.1 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:0af748d36d5d0eedb79affc288a84a3900ec2c2d541425dfc5a3997d88d705cb
3
+ size 359