The Lessons of Developing Process Reward Models in Mathematical Reasoning Paper • 2501.07301 • Published Jan 13 • 92
Gold-medalist Performance in Solving Olympiad Geometry with AlphaGeometry2 Paper • 2502.03544 • Published 29 days ago • 43
FoNE: Precise Single-Token Number Embeddings via Fourier Features Paper • 2502.09741 • Published 21 days ago • 11
SoS1: O1 and R1-Like Reasoning LLMs are Sum-of-Square Solvers Paper • 2502.20545 • Published 7 days ago • 19