chrisliu298
commited on
Update README.md
Browse files
README.md
CHANGED
@@ -46,9 +46,9 @@ We evaluate our model on [RewardBench](https://huggingface.co/spaces/allenai/rew
|
|
46 |
|
47 |
| Rank | Model | Chat | Chat Hard | Safety | Reasoning | Score |
|
48 |
| :---: | --------------------------- | :---: | :-------: | :----: | :-------: | :---: |
|
49 |
-
| 1 | Skywork-Reward-Gemma-2-27B | 95.8 | 91.4 | 92.0 | 96.
|
50 |
| 2 | SFR-LLaMa-3.1-70B-Judge-r | 96.9 | 84.8 | 92.2 | 97.6 | 92.8 |
|
51 |
-
| 3 | Skywork-Reward-Llama-3.1-8B |
|
52 |
| 4 | Nemotron-4-340B-Reward | 95.8 | 87.1 | 92.2 | 93.6 | 92.2 |
|
53 |
| 5 | ArmoRM-Llama3-8B-v0.1 | 96.9 | 76.8 | 92.2 | 97.3 | 90.8 |
|
54 |
| 6 | internlm2-20b-reward | 98.9 | 76.5 | 89.9 | 95.8 | 90.3 |
|
|
|
46 |
|
47 |
| Rank | Model | Chat | Chat Hard | Safety | Reasoning | Score |
|
48 |
| :---: | --------------------------- | :---: | :-------: | :----: | :-------: | :---: |
|
49 |
+
| 1 | Skywork-Reward-Gemma-2-27B | 95.8 | 91.4 | 92.0 | 96.1 | 93.8 |
|
50 |
| 2 | SFR-LLaMa-3.1-70B-Judge-r | 96.9 | 84.8 | 92.2 | 97.6 | 92.8 |
|
51 |
+
| 3 | Skywork-Reward-Llama-3.1-8B | 95.8 | 87.3 | 90.6 | 96.2 | 92.5 |
|
52 |
| 4 | Nemotron-4-340B-Reward | 95.8 | 87.1 | 92.2 | 93.6 | 92.2 |
|
53 |
| 5 | ArmoRM-Llama3-8B-v0.1 | 96.9 | 76.8 | 92.2 | 97.3 | 90.8 |
|
54 |
| 6 | internlm2-20b-reward | 98.9 | 76.5 | 89.9 | 95.8 | 90.3 |
|