josedanielaromi
/

distilgpt2-finetuned-wikitext2

Text Generation

Generated from Trainer

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

josedanielaromi commited on Sep 22, 2023

Commit

4c9f24b

•

1 Parent(s): 6c981d1

End of training

Files changed (3) hide show

README.md +12 -5
pytorch_model.bin +1 -1
training_args.bin +1 -1

README.md CHANGED Viewed

@@ -15,7 +15,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [distilgpt2](https://huggingface.co/distilgpt2) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 3.6665
 ## Model description
@@ -40,15 +40,22 @@ The following hyperparameters were used during training:
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
-- num_epochs: 3.0
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
-| No log        | 1.0   | 24   | 3.7376          |
-| No log        | 2.0   | 48   | 3.6848          |
-| No log        | 3.0   | 72   | 3.6665          |
 ### Framework versions

 This model is a fine-tuned version of [distilgpt2](https://huggingface.co/distilgpt2) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 3.2676
 ## Model description
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
+- num_epochs: 10
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
+| No log        | 1.0   | 28   | 3.7190          |
+| No log        | 2.0   | 56   | 3.5623          |
+| No log        | 3.0   | 84   | 3.4554          |
+| No log        | 4.0   | 112  | 3.3877          |
+| No log        | 5.0   | 140  | 3.3390          |
+| No log        | 6.0   | 168  | 3.3063          |
+| No log        | 7.0   | 196  | 3.2878          |
+| No log        | 8.0   | 224  | 3.2761          |
+| No log        | 9.0   | 252  | 3.2698          |
+| No log        | 10.0  | 280  | 3.2676          |
 ### Framework versions

pytorch_model.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:a90ad3d2c6ced5c89a596345a852d6c11a8190b6122e759843c960e7177cf5f3
 size 327673749

 version https://git-lfs.github.com/spec/v1
+oid sha256:77eddf4f8e6ae2ced886ef14e9e828db4df1199f1c46a22370b01d7c589d3a22
 size 327673749

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:194cf7fd34e7a4e8f0a2a27f865c3a1161f20cd553f4bafd216d2cea52fa2071
 size 4027

 version https://git-lfs.github.com/spec/v1
+oid sha256:15745e02dea6b6e305568183af208e6d7c57f3705fde17ed1d503a34ca6d774a
 size 4027