End of training

Files changed (4) hide show

README.md CHANGED Viewed

@@ -15,7 +15,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [distilgpt2](https://huggingface.co/distilgpt2) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 2.7285
 ## Model description
@@ -41,13 +41,19 @@ The following hyperparameters were used during training:
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: cosine
 - lr_scheduler_warmup_steps: 1000
-- num_epochs: 1
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
-| 3.3117        | 0.66  | 1000 | 2.7285          |
 ### Framework versions

 This model is a fine-tuned version of [distilgpt2](https://huggingface.co/distilgpt2) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 2.7754
 ## Model description
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: cosine
 - lr_scheduler_warmup_steps: 1000
+- num_epochs: 5
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
+| 3.6372        | 0.66  | 1000 | 2.8398          |
+| 3.0548        | 1.32  | 2000 | 2.7754          |
+| 3.0124        | 1.99  | 3000 | 2.7832          |
+| 3.0056        | 2.65  | 4000 | 2.8105          |
+| 2.9911        | 3.31  | 5000 | 2.7812          |
+| 2.983         | 3.97  | 6000 | 2.7676          |
+| 2.9735        | 4.64  | 7000 | 2.7754          |
 ### Framework versions

adapter_model.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:aae7b1c77a407bcafeee2fc69e48abddb4dc0247d43fd0e94910d3060183088a
 size 418013

 version https://git-lfs.github.com/spec/v1
+oid sha256:deab2a915f6881025fbc99c13f13b338d94cc8248f0076f261b80f3193525820
 size 418013

tokenizer.json CHANGED Viewed

@@ -1,7 +1,19 @@
 {
   "version": "1.0",
-  "truncation": null,
-  "padding": null,
   "added_tokens": [
     {
       "id": 50256,

 {
   "version": "1.0",
+  "truncation": {
+    "direction": "Right",
+    "max_length": 1024,
+    "strategy": "LongestFirst",
+    "stride": 0
+  },
+  "padding": {
+    "strategy": "BatchLongest",
+    "direction": "Right",
+    "pad_to_multiple_of": null,
+    "pad_id": 50256,
+    "pad_type_id": 0,
+    "pad_token": "<|endoftext|>"
+  },
   "added_tokens": [
     {
       "id": 50256,

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:506f6e20cd8c3db2c7ae9c5c31139cf0ee81ce8f7aa587ee77d824b22310b13b
 size 4091

 version https://git-lfs.github.com/spec/v1
+oid sha256:830f091b7b9ed63b66cc6d35c8205ab488e9c026944bd3b61de3c7d3896f62cb
 size 4091