File size: 768 Bytes
8e99cfc
 
9d03ff9
8e99cfc
 
 
 
3e9fdaf
 
865ebcd
 
 
8e99cfc
 
 
 
 
865ebcd
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
---
language: gn
license: mit
datasets:
- wikipedia
- wiktionary
widget:
- text: "Paraguay ha'e peteĩ táva oĩva [MASK] retãme"
- text: "Augusto Roa Bastos ha'e peteĩ [MASK] arandu"
metrics:
- f1
- accuracy
---

# BERT-i-base-cased (gnBERT-base-cased)

A pre-trained BERT model for **Guarani** (12 layers, cased). Trained on Wikipedia + Wiktionary (~800K tokens).

# How cite?

```
@article{aguero-et-al2023multi-affect-low-langs-grn,
  title={Multidimensional Affective Analysis for Low-resource Languages: A Use Case with Guarani-Spanish Code-switching Language},
  author={Agüero-Torales, Marvin Matías, López-Herrera, Antonio Gabriel, and Vilares, David},
  journal={Cognitive Computation},
  year={2023},
  publisher={Springer},
  notes={Forthcoming}
}
```