Running
on
CPU Upgrade
159
🥇
MMLU Pro
More advanced and challenging multi-task evaluation
More advanced and challenging multi-task evaluation
View how beam search decoding works, in detail!
Leaderboard for long LLM on In-context Learning