Running on CPU Upgrade 12.4k 12.4k Open LLM Leaderboard 🏆 Track, rank and evaluate open LLMs and chatbots
OlympicArena: Benchmarking Multi-discipline Cognitive Reasoning for Superintelligent AI Paper • 2406.12753 • Published Jun 18, 2024 • 14