-
12.1kπ
Open LLM Leaderboard
Track, rank and evaluate open LLMs and chatbots
-
105ποΈ
Open-LLM performances are plateauing, letβs make the leaderboard steep again
-
open-llm-leaderboard/contents
Viewer β’ Updated β’ 2.41k β’ 13.2k β’ 5 -
open-llm-leaderboard/results
Preview β’ Updated β’ 34.9k β’ 7
Open LLM Leaderboard
community
AI & ML interests
Evaluating open LLMs
Recent Activity
View all activity
Organization Card
Open LLM Leaderboard
This is the hub organisation maintaining the Open LLM Leaderboard.
In this space you will find the dataset with detailed results and queries for the models on the leaderboard.
Score results are here, and current state of requests is here. For the detailed prediction, look for your model name in the datasets below!
Collections
2
Gathering benchmark spaces on the hub (beyond the Open LLM Leaderboard)
spaces
5
pinned
Running
on
CPU Upgrade
12.1k
π
Open LLM Leaderboard
Track, rank and evaluate open LLMs and chatbots
Running
74
π
Open LLM Leaderboard Model Comparator
Compare Open LLM Leaderboard results
Running
105
ποΈ
Open-LLM performances are plateauing, letβs make the leaderboard steep again
Running
6
π
Exploring model generations
Runtime error
1
π
Sample Viewer
models
None public yet
datasets
2368
open-llm-leaderboard/contents
Viewer
β’
Updated
β’
2.41k
β’
13.2k
β’
5
open-llm-leaderboard/bfuzzy1__acheron-details
Updated
open-llm-leaderboard/requests
Preview
β’
Updated
β’
104k
β’
9
open-llm-leaderboard/results
Preview
β’
Updated
β’
34.9k
β’
7
open-llm-leaderboard/Minami-su__test-7B-01-details
Viewer
β’
Updated
β’
43.2k
β’
1
open-llm-leaderboard/ContactDoctor__Bio-Medical-Llama-3-8B-details
Viewer
β’
Updated
β’
43.2k
β’
2
open-llm-leaderboard/sometimesanotion__Qwen-14B-ProseStock-v4-details
Viewer
β’
Updated
β’
43.2k
β’
2
open-llm-leaderboard/Sakalti__ultiima-32B-details
Updated
β’
2
open-llm-leaderboard/qingy2024__QwQ-14B-Math-v0.2-details
Viewer
β’
Updated
β’
43.2k
β’
2
open-llm-leaderboard/Minami-su__test-7B-00-details
Viewer
β’
Updated
β’
43.2k
β’
3