openbmb
/

Eurux-8x22b-nca

Text Generation

preference_learning

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

lievan commited on Apr 15, 2024

Commit

661d7f9

·

verified ·

1 Parent(s): 1a68c57

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -43,7 +43,7 @@ It achieves superb reasoning performance as well as exellent chat & instruction-
 ## Evaluation
 We conducted overall coding, math, reasoning, knowledge, instruction-following and chat benchmarking. Results are shown below:
-| Models / Tasks  |   Coding  |       |          |   Math  |       |           | Reasoning | Knowledge | Ins-Following |   Chat   |
 |-----------------|:---------:|:-----:|:--------:|:-------:|:-----:|:---------:|:---------:|:---------:|:-------------:|:--------:|
 | Datasets        | HumanEval |  MBPP | LeetCode | GSMPLUS |  MATH | TheoremQA | BBH (CoT) |    MMLU   |     IFEval    | MT-Bench |
 | GPT-3.5-Turbo   |   76.8    | 82.5  |   23.3   |  61.2   | 37.8  |   35.6    |   70.1    |   70.0    |     56.6      |   7.94   |

 ## Evaluation
 We conducted overall coding, math, reasoning, knowledge, instruction-following and chat benchmarking. Results are shown below:
+| Models&Tasks    |   Coding  |       |          |   Math  |       |           | Reasoning | Knowledge | Ins-Following |   Chat   |
 |-----------------|:---------:|:-----:|:--------:|:-------:|:-----:|:---------:|:---------:|:---------:|:-------------:|:--------:|
 | Datasets        | HumanEval |  MBPP | LeetCode | GSMPLUS |  MATH | TheoremQA | BBH (CoT) |    MMLU   |     IFEval    | MT-Bench |
 | GPT-3.5-Turbo   |   76.8    | 82.5  |   23.3   |  61.2   | 37.8  |   35.6    |   70.1    |   70.0    |     56.6      |   7.94   |