Spaces:

kevinpro
/

Open-Multilingual-Reasoning-Leaderboard

Running

kevinpro commited on Mar 11, 2024

Commit

b220808

1 Parent(s): 562362e

commit message

Files changed (2) hide show

__pycache__/content.cpython-38.pyc CHANGED Viewed

Binary files a/__pycache__/content.cpython-38.pyc and b/__pycache__/content.cpython-38.pyc differ

content.py CHANGED Viewed

@@ -3,25 +3,24 @@ TITLE = '<h1 align="center" id="space-title">Open Multilingual Reasoning Leaderb
 INTRO_TEXT = f"""
 ## About
-This leaderboard tracks progress and ranks reasoning performance of large language models (LLMs) developed for different languages,
-emphasizing on non-English languages to democratize benefits of LLMs to broader society.
-Our current leaderboard provides evaluation data for 10 languages.
 Both multilingual and language-specific LLMs are welcome in this leaderboard.
-We currently evaluate models over four benchmarks:
 - <a href="https://huggingface.co/datasets/Mathoctopus/MSVAMP" target="_blank">  MSVAMP </a>
 - <a href="https://huggingface.co/datasets/juletxara/mgsm" target="_blank">  MGSM </a>
-- <a href="https://arxiv.org/abs/2009.03300" target="_blank">  MNumGLUESub </a>
-# """
-# HOW_TO = f"""
-# ## How to list your model performance on this leaderboard:
-# Run the evaluation of your model using this repo: <a href="https://github.com/nlp-uoregon/mlmm-evaluation" target="_blank">https://github.com/nlp-uoregon/mlmm-evaluation</a>.
-# And then, push the evaluation log and make a pull request.
-# """
 # CREDIT = f"""
 # ## Credit

 INTRO_TEXT = f"""
 ## About
+This leaderboard tracks and ranks the reasoning performance of the leading, most advanced multilingual reasoning LLMs on three multilingual mathematical reasoning benchmarks. Each benchmark contains 10 languages: Bengali, Swedish, Thailand, Chinese, Japan, Russian, French, Spanish, German and English.
 Both multilingual and language-specific LLMs are welcome in this leaderboard.
+## Benchmarks
 - <a href="https://huggingface.co/datasets/Mathoctopus/MSVAMP" target="_blank">  MSVAMP </a>
 - <a href="https://huggingface.co/datasets/juletxara/mgsm" target="_blank">  MGSM </a>
+- <a href="https://huggingface.co/datasets/kevinpro/MNumGLUESub" target="_blank">  MNumGLUESub </a>
+"""
+HOW_TO = f"""
+## How to list your model performance on this leaderboard:
+Run the evaluation of your model using this repo: <a href="https://github.com/NJUNLP/MAPO" target="_blank">https://github.com/NJUNLP/MAPO</a>.
+And then, push the evaluation log and make a pull request.
+"""
 # CREDIT = f"""
 # ## Credit