Running 539 539 Vision Arena (Testing VLMs side-by-side) πΌ Analyze images to detect and label objects
Running on CPU Upgrade 67 67 La Leaderboard πΈ Evaluate open LLMs in the languages of LATAM and Spain.
Evaluation Datasets Collection Basque Evaluation Datasets β’ 23 items β’ Updated Aug 31, 2024 β’ 1
Latxa Collection Latxa: An Open Language Model and Evaluation Suite for Basque β’ 10 items β’ Updated Jul 2, 2024 β’ 16
BertaQA: How Much Do Language Models Know About Local Culture? Paper β’ 2406.07302 β’ Published Jun 11, 2024