|
--- |
|
language: |
|
- ms |
|
tags: |
|
- paraphrase |
|
metrics: |
|
- sacrebleu |
|
--- |
|
|
|
# finetune-paraphrase-t5-tiny-standard-bahasa-cased |
|
|
|
Finetuned T5 tiny on MS paraphrase tasks. |
|
|
|
## Dataset |
|
|
|
1. translated PAWS, https://huggingface.co/datasets/mesolitica/translated-PAWS |
|
2. translated MRPC, https://huggingface.co/datasets/mesolitica/translated-MRPC |
|
|
|
## Finetune details |
|
|
|
1. Finetune using single RTX 3090 Ti. |
|
|
|
Scripts at https://github.com/huseinzol05/malaya/tree/master/session/paraphrase/hf-t5 |
|
|
|
## Supported prefix |
|
|
|
1. `parafrasa: {string}`, for MS paraphrase. |
|
|
|
## Evaluation |
|
|
|
Evaluated on MRPC validation set and PAWS test set. |
|
|
|
``` |
|
{'name': 'BLEU', |
|
'score': 61.06784273649806, |
|
'_mean': -1.0, |
|
'_ci': -1.0, |
|
'_verbose': '86.1/68.4/55.8/45.9 (BP = 0.980 ratio = 0.980 hyp_len = 138209 ref_len = 141004)', |
|
'bp': 0.9799801176769202, |
|
'counts': [119035, 89737, 69210, 53653], |
|
'totals': [138209, 131135, 124061, 116987], |
|
'sys_len': 138209, |
|
'ref_len': 141004, |
|
'precisions': [86.1268079502782, |
|
68.4310062149693, |
|
55.787072488533866, |
|
45.86236077512886], |
|
'prec_str': '86.1/68.4/55.8/45.9', |
|
'ratio': 0.9801778672945448} |
|
``` |