Update README.md
Browse files
README.md
CHANGED
@@ -15,6 +15,13 @@ Qwerus-7B is a merge of the following models using [LazyMergekit](https://colab.
|
|
15 |
* [PRIME-RL/Eurus-2-7B-PRIME](https://huggingface.co/PRIME-RL/Eurus-2-7B-PRIME)
|
16 |
* [Qwen/Qwen2.5-7B-Instruct](https://huggingface.co/Qwen/Qwen2.5-7B-Instruct)
|
17 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
18 |
## 🧩 Configuration
|
19 |
|
20 |
```yaml
|
|
|
15 |
* [PRIME-RL/Eurus-2-7B-PRIME](https://huggingface.co/PRIME-RL/Eurus-2-7B-PRIME)
|
16 |
* [Qwen/Qwen2.5-7B-Instruct](https://huggingface.co/Qwen/Qwen2.5-7B-Instruct)
|
17 |
|
18 |
+
Benchmark on reasoning tasks using lighteval:
|
19 |
+
|
20 |
+
| Task |Version| Metric |Value | |Stderr|
|
21 |
+
|-----------------|------:|----------------|-----:|---|-----:|
|
22 |
+
|aime24 | 1|extractive_match|0.1333|± |0.0631|
|
23 |
+
|math_500| 1|extractive_match|0.7420|± |0.0196|
|
24 |
+
|
25 |
## 🧩 Configuration
|
26 |
|
27 |
```yaml
|