mNLP-project
/

distilgpt2-finetuned

@@ -3,26 +3,11 @@ license: apache-2.0
 base_model: distilbert/distilgpt2
 tags:
 - generated_from_trainer
-datasets:
-- eli5_category
 metrics:
 - bleu
 model-index:
 - name: distilgpt2-finetuned
-  results:
-  - task:
-      name: Causal Language Modeling
-      type: text-generation
-    dataset:
-      name: eli5_category
-      type: eli5_category
-      config: default
-      split: None
-      args: default
-    metrics:
-    - name: Bleu
-      type: bleu
-      value: 0.010587533155110318
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -30,13 +15,13 @@ should probably proofread and complete it, then remove this comment. -->
 # distilgpt2-finetuned
-This model is a fine-tuned version of [distilbert/distilgpt2](https://huggingface.co/distilbert/distilgpt2) on the eli5_category dataset.
 It achieves the following results on the evaluation set:
-- Loss: 3.7703
-- Bleu: 0.0106
-- Bertscore Precision: 0.1609
-- Bertscore Recall: 0.1758
-- Bertscore F1: 0.1677
 ## Model description
@@ -66,11 +51,11 @@ The following hyperparameters were used during training:
 ### Training results
-| Training Loss | Epoch | Step  | Validation Loss | Bleu   | Bertscore Precision | Bertscore Recall | Bertscore F1 |
-|:-------------:|:-----:|:-----:|:---------------:|:------:|:-------------------:|:----------------:|:------------:|
-| 3.8816        | 1.0   | 4000  | 3.7775          | 0.0107 | 0.1607              | 0.1756           | 0.1675       |
-| 3.7273        | 2.0   | 8000  | 3.7660          | 0.0107 | 0.1608              | 0.1757           | 0.1676       |
-| 3.6125        | 3.0   | 12000 | 3.7703          | 0.0106 | 0.1609              | 0.1758           | 0.1677       |
 ### Framework versions

 base_model: distilbert/distilgpt2
 tags:
 - generated_from_trainer
 metrics:
 - bleu
 model-index:
 - name: distilgpt2-finetuned
+  results: []
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 # distilgpt2-finetuned
+This model is a fine-tuned version of [distilbert/distilgpt2](https://huggingface.co/distilbert/distilgpt2) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 3.8114
+- Bleu: 0.0101
+- Bertscore Precision: 0.1499
+- Bertscore Recall: 0.1656
+- Bertscore F1: 0.1571
 ## Model description
 ### Training results
+| Training Loss | Epoch | Step | Validation Loss | Bleu   | Bertscore Precision | Bertscore Recall | Bertscore F1 |
+|:-------------:|:-----:|:----:|:---------------:|:------:|:-------------------:|:----------------:|:------------:|
+| 4.1924        | 1.0   | 644  | 4.0681          | 0.0091 | 0.1493              | 0.1649           | 0.1564       |
+| 4.0754        | 2.0   | 1288 | 3.8779          | 0.0099 | 0.1498              | 0.1654           | 0.1569       |
+| 3.8277        | 3.0   | 1932 | 3.8114          | 0.0101 | 0.1499              | 0.1656           | 0.1571       |
 ### Framework versions

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:41d6a81d2957121e25266b5deefdd4d465e947761e64aec5d04f68c22f7a9651
 size 327657928

 version https://git-lfs.github.com/spec/v1
+oid sha256:f0c79c17a1750921715b59469a5381f7cc25934155f493fc482e492564bab7c1
 size 327657928

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:cc320c476cfad9c910c6c99a784234e4e5e9b59b1cd6298c0267335ab023f2d3
 size 5048

 version https://git-lfs.github.com/spec/v1
+oid sha256:65336c76c05ee6d71702e751ad04c2c9c8e62744efbcd520c27e8c322042bbd3
 size 5048