IMISLab
/

GreekT5-umt5-small-greeksum

text2text-generation

Inference Endpoints

Model card Files Files and versions Community

IMISLab commited on Nov 12, 2023

Commit

f22a5a9

·

1 Parent(s): 885351a

Update README.md

Files changed (1) hide show

README.md +2 -2

README.md CHANGED Viewed

@@ -62,7 +62,7 @@ For more information see: [https://arxiv.org/abs/2304.00869](https://arxiv.org/a
 ## Training configuration
-We trained `google/mt5-small` [300 million parameters (~1.20 GB)] on the GreekSUM train split using the following parameters:
 * GPU batch size = 6
 * Total training epochs = 10
 * AdamW optimizer (e = 1e−8, β1 = 0.9 and β2 = 0.0999)
@@ -76,7 +76,7 @@ We trained `google/mt5-small` [300 million parameters (~1.20 GB)] on the GreekSU
   * padding = ‘max_length’
   * truncation = True
-**Note:** T5-based models use a multi-task architecture, the prefix *‘summarize:’* was prepended in each training sample.
 ## Evaluation
 **Approach**|**ROUGE-1**|**ROUGE-2**|**ROUGE-L**|**BERTScore**

 ## Training configuration
+We trained `google/umt5-small` [300 million parameters (~1.20 GB)] on the GreekSUM train split using the following parameters:
 * GPU batch size = 6
 * Total training epochs = 10
 * AdamW optimizer (e = 1e−8, β1 = 0.9 and β2 = 0.0999)
   * padding = ‘max_length’
   * truncation = True
+**Note:** T5-based models use a multi-task architecture, the prefix *‘summarize: ’* was prepended in each training sample.
 ## Evaluation
 **Approach**|**ROUGE-1**|**ROUGE-2**|**ROUGE-L**|**BERTScore**