Spaces:

llm-jp
/

open-japanese-llm-leaderboard

Running on CPU Upgrade

AkimfromParis commited on Aug 13, 2024

Commit

0cc7ac6

verified ·

1 Parent(s): 067f637

Test font and layout for datasets in About v1.0

Files changed (1) hide show

src/about.py CHANGED Viewed

@@ -91,20 +91,16 @@ LLM_BENCHMARKS_TEXT = f"""
 📈 We evaluate Japanese Large Language Models on 52 key benchmarks leveraging our evaluation tool [llm-jp-eval](https://github.com/llm-jp/llm-jp-eval), a unified framework to evaluate Japanese LLMs on various evaluation tasks.
 Benchmarks:
-NLI (Natural Language Inference)
----
-`Jamp`
-Source：https://github.com/tomo-ut/temporalNLI_dataset
-License：CC BY-SA 4.0
-###JaNLI
 Source：https://github.com/verypluming/JaNLI
 License：CC BY-SA 4.0
-###JNLI
 Source：https://github.com/yahoojapan/JGLUE
 License：CC BY-SA 4.0

 📈 We evaluate Japanese Large Language Models on 52 key benchmarks leveraging our evaluation tool [llm-jp-eval](https://github.com/llm-jp/llm-jp-eval), a unified framework to evaluate Japanese LLMs on various evaluation tasks.
 Benchmarks:
+**NLI (Natural Language Inference)**
+- `Jamp`  JAMP, a Japanese NLI benchmark focused on temporal inference [Source](https://github.com/tomo-ut/temporalNLI_dataset) | License CC BY-SA 4.0
+### JaNLI
 Source：https://github.com/verypluming/JaNLI
 License：CC BY-SA 4.0
+#### JNLI
 Source：https://github.com/yahoojapan/JGLUE
 License：CC BY-SA 4.0