End of training

Files changed (3) hide show

README.md CHANGED Viewed

@@ -18,7 +18,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [distilbert/distilgpt2](https://huggingface.co/distilbert/distilgpt2) on the eli5_category dataset.
 It achieves the following results on the evaluation set:
-- Loss: 3.9243
 ## Model description
@@ -49,9 +49,9 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
-| No log        | 1.0   | 282  | 3.9230          |
-| 3.9671        | 2.0   | 564  | 3.9234          |
-| 3.9671        | 3.0   | 846  | 3.9243          |
 ### Framework versions

 This model is a fine-tuned version of [distilbert/distilgpt2](https://huggingface.co/distilbert/distilgpt2) on the eli5_category dataset.
 It achieves the following results on the evaluation set:
+- Loss: 3.9296
 ## Model description
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
+| No log        | 1.0   | 270  | 3.9288          |
+| 3.9605        | 2.0   | 540  | 3.9270          |
+| 3.9605        | 3.0   | 810  | 3.9296          |
 ### Framework versions

runs/Sep11_00-02-26_a4bf01945d25.jf.intel.com/events.out.tfevents.1726038150.a4bf01945d25.jf.intel.com.433654.0 CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:b7bcac06a0a6e5c9e1e4c94e6654f00f40839d0c08bbc26acfad4c875daba62c
-size 6091

 version https://git-lfs.github.com/spec/v1
+oid sha256:58053ba178baf4790fff68be2ffcd2190ceb28a964293695fba7604bbf3afd91
+size 6716

runs/Sep11_00-02-26_a4bf01945d25.jf.intel.com/events.out.tfevents.1726038218.a4bf01945d25.jf.intel.com.433654.1 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:31f01e41349185fdb5cf5028d133241fc4b161e9c121e07ac2b7fb1c6a8c79db
+size 359