Update README.md
Browse files
README.md
CHANGED
@@ -60,13 +60,10 @@ The model was trained on a combination of the following datasets:
|
|
60 |
| WikiMatrix | 358.873 | 317.649 |
|
61 |
| GNOME | 5.211 | 1.752|
|
62 |
| KDE4 | 166.208 | 117.828 |
|
63 |
-
| QED | 53.635 | 43.736 |
|
64 |
-
| TED2020 v1 | 48.942 | 41.461 |
|
65 |
| OpenSubtitles | 384.142 | 235.604 |
|
66 |
| GlobalVoices| 4.035 | 3.430|
|
67 |
| Tatoeba | 754 | 723 |
|
68 |
| Europarl | 1.692.106 | 1.631.989 |
|
69 |
-
| **Total** | **15.391.745** | **6.159.631** |
|
70 |
|
71 |
All corpora except Europarl were collected from [Opus](https://opus.nlpl.eu/).
|
72 |
The Europarl corpus is a synthetic parallel corpus created from the original Spanish-Catalan corpus by [SoftCatalà](https://github.com/Softcatala/Europarl-catalan).
|
|
|
60 |
| WikiMatrix | 358.873 | 317.649 |
|
61 |
| GNOME | 5.211 | 1.752|
|
62 |
| KDE4 | 166.208 | 117.828 |
|
|
|
|
|
63 |
| OpenSubtitles | 384.142 | 235.604 |
|
64 |
| GlobalVoices| 4.035 | 3.430|
|
65 |
| Tatoeba | 754 | 723 |
|
66 |
| Europarl | 1.692.106 | 1.631.989 |
|
|
|
67 |
|
68 |
All corpora except Europarl were collected from [Opus](https://opus.nlpl.eu/).
|
69 |
The Europarl corpus is a synthetic parallel corpus created from the original Spanish-Catalan corpus by [SoftCatalà](https://github.com/Softcatala/Europarl-catalan).
|