AfriHate: A Multilingual Collection of Hate Speech and Abusive Language Datasets for African Languages Paper • 2501.08284 • Published 10 days ago • 6
Building Foundations for Natural Language Processing of Historical Turkish: Resources and Models Paper • 2501.04828 • Published 16 days ago • 11
view post Post 3186 🇸🇰 Hovorte po slovensky? Help build better AI for Slovak! We only need 90 more annotations to include Slovak in the next Hugging Face FineWeb2-C dataset ( data-is-better-together/fineweb-c) release! Your contribution will help create better language models for 5+ million Slovak speakers.Annotate here: data-is-better-together/fineweb-c.Read more about why we're doing it: https://huggingface.co/blog/davanstrien/fineweb2-community See translation 3 replies · ❤️ 10 10 🤝 1 1 🚀 1 1 😔 1 1 + Reply
U-MATH and μ-MATH - University-level math evaluation Collection Paper: A UNIVERSITY-LEVEL BENCHMARK FOR EVALUATING MATHEMATICAL SKILLS IN LLMS • 4 items • Updated 11 days ago • 15