Spaces:
Running
on
CPU Upgrade
Running
on
CPU Upgrade
Test font and layout for datasets in About v1.0
Browse files- src/about.py +4 -8
src/about.py
CHANGED
@@ -91,20 +91,16 @@ LLM_BENCHMARKS_TEXT = f"""
|
|
91 |
📈 We evaluate Japanese Large Language Models on 52 key benchmarks leveraging our evaluation tool [llm-jp-eval](https://github.com/llm-jp/llm-jp-eval), a unified framework to evaluate Japanese LLMs on various evaluation tasks.
|
92 |
|
93 |
Benchmarks:
|
94 |
-
NLI (Natural Language Inference)
|
95 |
-
---
|
96 |
|
97 |
-
`Jamp`
|
98 |
|
99 |
-
|
100 |
-
License:CC BY-SA 4.0
|
101 |
-
|
102 |
-
###JaNLI
|
103 |
|
104 |
Source:https://github.com/verypluming/JaNLI
|
105 |
License:CC BY-SA 4.0
|
106 |
|
107 |
-
|
108 |
|
109 |
Source:https://github.com/yahoojapan/JGLUE
|
110 |
License:CC BY-SA 4.0
|
|
|
91 |
📈 We evaluate Japanese Large Language Models on 52 key benchmarks leveraging our evaluation tool [llm-jp-eval](https://github.com/llm-jp/llm-jp-eval), a unified framework to evaluate Japanese LLMs on various evaluation tasks.
|
92 |
|
93 |
Benchmarks:
|
94 |
+
**NLI (Natural Language Inference)**
|
|
|
95 |
|
96 |
+
- `Jamp` JAMP, a Japanese NLI benchmark focused on temporal inference [Source](https://github.com/tomo-ut/temporalNLI_dataset) | License CC BY-SA 4.0
|
97 |
|
98 |
+
### JaNLI
|
|
|
|
|
|
|
99 |
|
100 |
Source:https://github.com/verypluming/JaNLI
|
101 |
License:CC BY-SA 4.0
|
102 |
|
103 |
+
#### JNLI
|
104 |
|
105 |
Source:https://github.com/yahoojapan/JGLUE
|
106 |
License:CC BY-SA 4.0
|