Create README.md
Browse files
README.md
ADDED
@@ -0,0 +1,49 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
---
|
2 |
+
license: apache-2.0
|
3 |
+
datasets:
|
4 |
+
- SZTAKI-HLT/HunSum-2-abstractive
|
5 |
+
language:
|
6 |
+
- hu
|
7 |
+
metrics:
|
8 |
+
- rouge
|
9 |
+
pipeline_tag: summarization
|
10 |
+
inference:
|
11 |
+
parameters:
|
12 |
+
num_beams: 5
|
13 |
+
length_penalty: 2
|
14 |
+
max_length: 128
|
15 |
+
encoder_no_repeat_ngram_size: 4
|
16 |
+
no_repeat_ngram_size: 3
|
17 |
+
---
|
18 |
+
|
19 |
+
# Model Card for mT5-base-HunSum-2
|
20 |
+
|
21 |
+
The mT5-base-HunSum-1 is a Hungarian abstractive summarization model, which was trained on the [SZTAKI-HLT/HunSum-2-abstractive dataset](https://huggingface.co/datasets/SZTAKI-HLT/HunSum-2-abstractive).
|
22 |
+
The model is based on [google/mt5-base](https://huggingface.co/google/mt5-base).
|
23 |
+
|
24 |
+
## Intended uses & limitations
|
25 |
+
|
26 |
+
- **Model type:** Text Summarization
|
27 |
+
- **Language(s) (NLP):** Hungarian
|
28 |
+
- **Resource(s) for more information:**
|
29 |
+
- [GitHub Repo](https://github.com/botondbarta/summarization)
|
30 |
+
|
31 |
+
## Parameters
|
32 |
+
|
33 |
+
- **Batch Size:** 12
|
34 |
+
- **Learning Rate:** 5e-5
|
35 |
+
- **Weight Decay:** 0.01
|
36 |
+
- **Warmup Steps:** 3000
|
37 |
+
- **Epochs:** 10
|
38 |
+
- **no_repeat_ngram_size:** 3
|
39 |
+
- **num_beams:** 5
|
40 |
+
- **early_stopping:** False
|
41 |
+
- **encoder_no_repeat_ngram_size:** 4
|
42 |
+
|
43 |
+
## Results
|
44 |
+
|
45 |
+
| Metric | Value |
|
46 |
+
| :------------ | :------------------------------------------ |
|
47 |
+
| ROUGE-1 | 40.06 |
|
48 |
+
| ROUGE-2 | 12.67 |
|
49 |
+
| ROUGE-L | 25.93 |
|