faaany commited on
Commit
320d8c9
1 Parent(s): a741dff

End of training

Browse files
README.md CHANGED
@@ -18,7 +18,7 @@ should probably proofread and complete it, then remove this comment. -->
18
 
19
  This model is a fine-tuned version of [distilbert/distilgpt2](https://huggingface.co/distilbert/distilgpt2) on the eli5_category dataset.
20
  It achieves the following results on the evaluation set:
21
- - Loss: 3.8341
22
 
23
  ## Model description
24
 
@@ -49,9 +49,9 @@ The following hyperparameters were used during training:
49
 
50
  | Training Loss | Epoch | Step | Validation Loss |
51
  |:-------------:|:-----:|:----:|:---------------:|
52
- | 3.952 | 1.0 | 1321 | 3.8483 |
53
- | 3.8429 | 2.0 | 2642 | 3.8352 |
54
- | 3.811 | 3.0 | 3963 | 3.8341 |
55
 
56
 
57
  ### Framework versions
 
18
 
19
  This model is a fine-tuned version of [distilbert/distilgpt2](https://huggingface.co/distilbert/distilgpt2) on the eli5_category dataset.
20
  It achieves the following results on the evaluation set:
21
+ - Loss: 3.9243
22
 
23
  ## Model description
24
 
 
49
 
50
  | Training Loss | Epoch | Step | Validation Loss |
51
  |:-------------:|:-----:|:----:|:---------------:|
52
+ | No log | 1.0 | 282 | 3.9230 |
53
+ | 3.9671 | 2.0 | 564 | 3.9234 |
54
+ | 3.9671 | 3.0 | 846 | 3.9243 |
55
 
56
 
57
  ### Framework versions
runs/Sep10_23-51-17_a4bf01945d25.jf.intel.com/events.out.tfevents.1726037481.a4bf01945d25.jf.intel.com.424034.0 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:f44fbae4b4c36b1c572f5f0bb66d4ce4503b74fb55275c5ec1fcf2efbd6c5824
3
- size 6091
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:4837150f238fde9f26e19512c9e74dda64eedde620f0b51a3b6a01fa6e37927a
3
+ size 6716
runs/Sep10_23-51-17_a4bf01945d25.jf.intel.com/events.out.tfevents.1726037537.a4bf01945d25.jf.intel.com.424034.1 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:e909473ece3f2c0a43d6cad60287fb1711b4229ea5eeff6759198902c542dd19
3
+ size 359