nvidia
/

AceMath-7B-RM

Text Generation

Model card Files Files and versions Community

ychenNLP commited on 4 days ago

Commit

2a7b810

·

verified ·

1 Parent(s): 1e072c9

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -53,7 +53,7 @@ We compare AceMath to leading proprietary and open-access math models in above T
 | AceMath-7B-RM (Ours)     | 96.66 | 85.47   | 41.96        | 73.82          | 46.81           | 46.37        | 80.78     | 67.41  |
 | AceMath-72B-RM (Ours)    | 97.23 | 86.72   | 45.06        | 74.69          | 49.23           | 46.79        | 87.01     | 69.53  |
-*Reward model evaluation on AceMath-RewardBench. The average results (rm@8) of reward models on math benchmarks, randomly sample 8 responses from 64 candidates with 100 random seeds. Response candidates are generated from a pool of 8 LLMs.
 ## How to use
 ```python

 | AceMath-7B-RM (Ours)     | 96.66 | 85.47   | 41.96        | 73.82          | 46.81           | 46.37        | 80.78     | 67.41  |
 | AceMath-72B-RM (Ours)    | 97.23 | 86.72   | 45.06        | 74.69          | 49.23           | 46.79        | 87.01     | 69.53  |
+*Reward model evaluation on [AceMath-RewardBench](https://huggingface.co/datasets/nvidia/AceMath-RewardBench). The average results (rm@8) of reward models on math benchmarks, randomly sample 8 responses from 64 candidates with 100 random seeds. Response candidates are generated from a pool of 8 LLMs.
 ## How to use
 ```python