Hanbin Wang

hanbin

AI & ML interests

Code Intelligence and LLM Reasoning (Code, Math)

Recent Activity

Articles

Organizations

OpenBMB's profile picture PRIME's profile picture

hanbin's activity

New activity in PRIME-RL/Eurus-2-7B-PRIME 26 days ago

Evaluation

6
#1 opened 27 days ago by
tugstugi
upvoted an article 28 days ago
view article
Article

Process Reinforcement through Implicit Rewards

By ganqu
20