Invalid LLM Leaderboard results
#1
by
cmp-nct
- opened
This model received a math score of 0%, I've checked the resultset and this is not due to wrong answers. The answers I checked all were correct.
The leaderboard was not capable to parse the formating (boxed() ) answers. This model would score significantly higher with proper leaderboard results.