thymiantheherb commited on
Commit
32929de
1 Parent(s): 44fcc4b

End of training

Browse files
Files changed (3) hide show
  1. README.md +12 -27
  2. model.safetensors +1 -1
  3. training_args.bin +1 -1
README.md CHANGED
@@ -3,26 +3,11 @@ license: apache-2.0
3
  base_model: distilbert/distilgpt2
4
  tags:
5
  - generated_from_trainer
6
- datasets:
7
- - eli5_category
8
  metrics:
9
  - bleu
10
  model-index:
11
  - name: distilgpt2-finetuned
12
- results:
13
- - task:
14
- name: Causal Language Modeling
15
- type: text-generation
16
- dataset:
17
- name: eli5_category
18
- type: eli5_category
19
- config: default
20
- split: None
21
- args: default
22
- metrics:
23
- - name: Bleu
24
- type: bleu
25
- value: 0.010587533155110318
26
  ---
27
 
28
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -30,13 +15,13 @@ should probably proofread and complete it, then remove this comment. -->
30
 
31
  # distilgpt2-finetuned
32
 
33
- This model is a fine-tuned version of [distilbert/distilgpt2](https://huggingface.co/distilbert/distilgpt2) on the eli5_category dataset.
34
  It achieves the following results on the evaluation set:
35
- - Loss: 3.7703
36
- - Bleu: 0.0106
37
- - Bertscore Precision: 0.1609
38
- - Bertscore Recall: 0.1758
39
- - Bertscore F1: 0.1677
40
 
41
  ## Model description
42
 
@@ -66,11 +51,11 @@ The following hyperparameters were used during training:
66
 
67
  ### Training results
68
 
69
- | Training Loss | Epoch | Step | Validation Loss | Bleu | Bertscore Precision | Bertscore Recall | Bertscore F1 |
70
- |:-------------:|:-----:|:-----:|:---------------:|:------:|:-------------------:|:----------------:|:------------:|
71
- | 3.8816 | 1.0 | 4000 | 3.7775 | 0.0107 | 0.1607 | 0.1756 | 0.1675 |
72
- | 3.7273 | 2.0 | 8000 | 3.7660 | 0.0107 | 0.1608 | 0.1757 | 0.1676 |
73
- | 3.6125 | 3.0 | 12000 | 3.7703 | 0.0106 | 0.1609 | 0.1758 | 0.1677 |
74
 
75
 
76
  ### Framework versions
 
3
  base_model: distilbert/distilgpt2
4
  tags:
5
  - generated_from_trainer
 
 
6
  metrics:
7
  - bleu
8
  model-index:
9
  - name: distilgpt2-finetuned
10
+ results: []
 
 
 
 
 
 
 
 
 
 
 
 
 
11
  ---
12
 
13
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 
15
 
16
  # distilgpt2-finetuned
17
 
18
+ This model is a fine-tuned version of [distilbert/distilgpt2](https://huggingface.co/distilbert/distilgpt2) on the None dataset.
19
  It achieves the following results on the evaluation set:
20
+ - Loss: 3.8114
21
+ - Bleu: 0.0101
22
+ - Bertscore Precision: 0.1499
23
+ - Bertscore Recall: 0.1656
24
+ - Bertscore F1: 0.1571
25
 
26
  ## Model description
27
 
 
51
 
52
  ### Training results
53
 
54
+ | Training Loss | Epoch | Step | Validation Loss | Bleu | Bertscore Precision | Bertscore Recall | Bertscore F1 |
55
+ |:-------------:|:-----:|:----:|:---------------:|:------:|:-------------------:|:----------------:|:------------:|
56
+ | 4.1924 | 1.0 | 644 | 4.0681 | 0.0091 | 0.1493 | 0.1649 | 0.1564 |
57
+ | 4.0754 | 2.0 | 1288 | 3.8779 | 0.0099 | 0.1498 | 0.1654 | 0.1569 |
58
+ | 3.8277 | 3.0 | 1932 | 3.8114 | 0.0101 | 0.1499 | 0.1656 | 0.1571 |
59
 
60
 
61
  ### Framework versions
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:41d6a81d2957121e25266b5deefdd4d465e947761e64aec5d04f68c22f7a9651
3
  size 327657928
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:f0c79c17a1750921715b59469a5381f7cc25934155f493fc482e492564bab7c1
3
  size 327657928
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:cc320c476cfad9c910c6c99a784234e4e5e9b59b1cd6298c0267335ab023f2d3
3
  size 5048
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:65336c76c05ee6d71702e751ad04c2c9c8e62744efbcd520c27e8c322042bbd3
3
  size 5048