Bloof commited on
Commit
fddf5d2
1 Parent(s): 083c0d9

End of training

Browse files
README.md CHANGED
@@ -15,7 +15,7 @@ should probably proofread and complete it, then remove this comment. -->
15
 
16
  This model is a fine-tuned version of [distilgpt2](https://huggingface.co/distilgpt2) on an unknown dataset.
17
  It achieves the following results on the evaluation set:
18
- - Loss: 3.7180
19
 
20
  ## Model description
21
 
@@ -40,15 +40,20 @@ The following hyperparameters were used during training:
40
  - seed: 42
41
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
42
  - lr_scheduler_type: linear
43
- - num_epochs: 3.0
44
 
45
  ### Training results
46
 
47
  | Training Loss | Epoch | Step | Validation Loss |
48
  |:-------------:|:-----:|:----:|:---------------:|
49
- | 4.4264 | 1.0 | 852 | 3.8693 |
50
- | 4.0268 | 2.0 | 1704 | 3.7511 |
51
- | 3.9332 | 3.0 | 2556 | 3.7180 |
 
 
 
 
 
52
 
53
 
54
  ### Framework versions
 
15
 
16
  This model is a fine-tuned version of [distilgpt2](https://huggingface.co/distilgpt2) on an unknown dataset.
17
  It achieves the following results on the evaluation set:
18
+ - Loss: 3.3031
19
 
20
  ## Model description
21
 
 
40
  - seed: 42
41
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
42
  - lr_scheduler_type: linear
43
+ - num_epochs: 8
44
 
45
  ### Training results
46
 
47
  | Training Loss | Epoch | Step | Validation Loss |
48
  |:-------------:|:-----:|:----:|:---------------:|
49
+ | 3.8899 | 1.0 | 852 | 3.5930 |
50
+ | 3.7515 | 2.0 | 1704 | 3.4929 |
51
+ | 3.6564 | 3.0 | 2556 | 3.4250 |
52
+ | 3.6221 | 4.0 | 3408 | 3.3787 |
53
+ | 3.5517 | 5.0 | 4260 | 3.3441 |
54
+ | 3.5009 | 6.0 | 5112 | 3.3200 |
55
+ | 3.5012 | 7.0 | 5964 | 3.3080 |
56
+ | 3.4741 | 8.0 | 6816 | 3.3031 |
57
 
58
 
59
  ### Framework versions
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:066675d294b3bfbb1c13e3fba9a70f1a782d56937aa8bc401ea0d1418c053677
3
  size 327657928
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:bae0bdc2b503d19953ead2873f89b65eeda2c0372b00050bc660ee8a43872885
3
  size 327657928
runs/May09_04-42-31_af5bf23131d4/events.out.tfevents.1715229757.af5bf23131d4.900.3 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:dd0398055a306bfb6df6d8827836f19c37e1626e3a072e54273a9c05f6ec9ec5
3
- size 9717
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:04fed704bd43cb835d17dd8afaae2380538d02309948604859a31e943ed5fe7b
3
+ size 10342
runs/May09_04-42-31_af5bf23131d4/events.out.tfevents.1715231474.af5bf23131d4.900.4 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:87d3dcd191cca03baf8a5e3edd04ff2ece9adce30483b29adfdfc99726c214fa
3
+ size 359