SEACrowd: A Multilingual Multimodal Data Hub and Benchmark Suite for Southeast Asian Languages
Paper
•
2406.10118
•
Published
•
27
SEACrowd is a community movement project aimed at centralizing and standardizing AI resources for Southeast Asian languages, cultures, and/or regions.
Note Our paper.
Note Our fine-tuned SEA translationese classifier. Based on the mDeBERTa model by Microsoft.
Note Our translationese vs. natural train/test data.