End of training

Files changed (3) hide show

README.md CHANGED Viewed

@@ -17,7 +17,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [distilbert/distilgpt2](https://huggingface.co/distilbert/distilgpt2) on the eli5_category dataset.
 It achieves the following results on the evaluation set:
-- Loss: 3.8235
 ## Model description
@@ -48,9 +48,9 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
-| 3.946         | 1.0   | 1323 | 3.8340          |
-| 3.8542        | 2.0   | 2646 | 3.8252          |
-| 3.8166        | 3.0   | 3969 | 3.8235          |
 ### Framework versions

 This model is a fine-tuned version of [distilbert/distilgpt2](https://huggingface.co/distilbert/distilgpt2) on the eli5_category dataset.
 It achieves the following results on the evaluation set:
+- Loss: 3.8206
 ## Model description
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
+| 3.9493        | 1.0   | 1316 | 3.8335          |
+| 3.8627        | 2.0   | 2632 | 3.8231          |
+| 3.823         | 3.0   | 3948 | 3.8206          |
 ### Framework versions

runs/Aug28_07-13-07_56791c3d18e1/events.out.tfevents.1724829189.56791c3d18e1.7951.0 CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:099bbc7069b4dff60de45c318b193ceb1434d150de3b360a5641fb4be48cfea2
-size 7227

 version https://git-lfs.github.com/spec/v1
+oid sha256:dc36add2011a4185d65e5b397feb2e8d1748ca7c086ce90b4cea44b13cfc696b
+size 7852

runs/Aug28_07-13-07_56791c3d18e1/events.out.tfevents.1724830579.56791c3d18e1.7951.1 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:c51a944f8e7a03da11399aa6a0b9b3e6f92ebac77d4209e73e507b69cee705a1
+size 359