Open Arabic LLM Leaderboard

non-profit
Activity Feed

AI & ML interests

LLM Evaluation

Recent Activity

Hamza-Alobeidli  updated a dataset about 1 hour ago
OALL/requests
amztheory  updated a dataset about 1 hour ago
OALL/requests
alielfilali01  updated a dataset about 1 hour ago
OALL/requests
View all activity

OALL's activity

alielfilali01 
posted an update 12 days ago
view post
Post
3298
Unpopular opinion: Open Source takes courage to do !

Not everyone is brave enough to release what they have done (the way they've done it) to the wild to be judged !
It really requires a high level of "knowing wth are you doing" ! It's kind of a super power !

Cheers to the heroes here who see this!
·
alielfilali01 
posted an update 16 days ago
view post
Post
1474
Apparently i forgot to put this here !

Well, this is a bit late but consider given our recent blog a read if you are interested in Evaluation.

You don't have to be into Arabic NLP in order to read it, the main contribution we are introducing is a new evaluation measure for NLG. We made the fisrt application of this measure on Arabic for now and we will be working with colleagues from the community to expand it to other languages.

Blog:
Rethinking LLM Evaluation with 3C3H: AraGen Benchmark and Leaderboard
https://huggingface.co/blog/leaderboard-3c3h-aragen

Space:
inceptionai/AraGen-Leaderboard

Give it a read and let me know your thoughts 🤗