Yanggang Wang

esheep
·

AI & ML interests

None yet

Recent Activity

liked a dataset 1 day ago
hkust-nlp/CodeIO-PyEdu-Reasoning
liked a dataset 2 days ago
KbsdJames/Omni-MATH
liked a model 3 days ago
Qwen/Qwen2.5-72B-Instruct
View all activity

Organizations

None yet

esheep's activity

upvoted an article 6 days ago
commented on Open R1: Update #2 7 days ago
view reply

How exactly is the Qwen/Qwen2.5-Math-RM-72B model used? Is it solely for ranking multiple answers? Can it also serve as a tool to validate whether the answers are correct?