KonradSzafer commited on
Commit
1b6cab1
·
1 Parent(s): 1c76f47

google/flan-t5-small

Browse files
Files changed (1) hide show
  1. README.md +32 -3
README.md CHANGED
@@ -4,9 +4,24 @@ tags:
4
  - generated_from_trainer
5
  datasets:
6
  - samsum
 
 
7
  model-index:
8
  - name: flan-t5-small-samsum
9
- results: []
 
 
 
 
 
 
 
 
 
 
 
 
 
10
  ---
11
 
12
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -15,6 +30,13 @@ should probably proofread and complete it, then remove this comment. -->
15
  # flan-t5-small-samsum
16
 
17
  This model is a fine-tuned version of [google/flan-t5-small](https://huggingface.co/google/flan-t5-small) on the samsum dataset.
 
 
 
 
 
 
 
18
 
19
  ## Model description
20
 
@@ -33,13 +55,20 @@ More information needed
33
  ### Training hyperparameters
34
 
35
  The following hyperparameters were used during training:
36
- - learning_rate: 5e-05
37
  - train_batch_size: 16
38
  - eval_batch_size: 16
39
  - seed: 42
40
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
41
  - lr_scheduler_type: linear
42
- - num_epochs: 0
 
 
 
 
 
 
 
43
 
44
  ### Framework versions
45
 
 
4
  - generated_from_trainer
5
  datasets:
6
  - samsum
7
+ metrics:
8
+ - rouge
9
  model-index:
10
  - name: flan-t5-small-samsum
11
+ results:
12
+ - task:
13
+ name: Sequence-to-sequence Language Modeling
14
+ type: text2text-generation
15
+ dataset:
16
+ name: samsum
17
+ type: samsum
18
+ config: samsum
19
+ split: test
20
+ args: samsum
21
+ metrics:
22
+ - name: Rouge1
23
+ type: rouge
24
+ value: 41.8884
25
  ---
26
 
27
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 
30
  # flan-t5-small-samsum
31
 
32
  This model is a fine-tuned version of [google/flan-t5-small](https://huggingface.co/google/flan-t5-small) on the samsum dataset.
33
+ It achieves the following results on the evaluation set:
34
+ - Loss: 1.7427
35
+ - Rouge1: 41.8884
36
+ - Rouge2: 17.881
37
+ - Rougel: 34.4405
38
+ - Rougelsum: 38.1283
39
+ - Gen Len: 16.8437
40
 
41
  ## Model description
42
 
 
55
  ### Training hyperparameters
56
 
57
  The following hyperparameters were used during training:
58
+ - learning_rate: 1e-05
59
  - train_batch_size: 16
60
  - eval_batch_size: 16
61
  - seed: 42
62
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
63
  - lr_scheduler_type: linear
64
+ - num_epochs: 1
65
+
66
+ ### Training results
67
+
68
+ | Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum | Gen Len |
69
+ |:-------------:|:-----:|:----:|:---------------:|:-------:|:------:|:-------:|:---------:|:-------:|
70
+ | 1.9011 | 1.0 | 32 | 1.7427 | 41.8884 | 17.881 | 34.4405 | 38.1283 | 16.8437 |
71
+
72
 
73
  ### Framework versions
74