mlfoundations-dev/multiple_samples_auto_verification_seed_code Viewer • Updated about 5 hours ago • 100 • 201
The Danger of Overthinking: Examining the Reasoning-Action Dilemma in Agentic Tasks Paper • 2502.08235 • Published 13 days ago • 53
mlfoundations-dev/math_stratos_scale_verified_with_hf Viewer • Updated about 1 month ago • 140k • 188
mlfoundations-dev/math_stratos_scale_verified_with_hf Viewer • Updated about 1 month ago • 140k • 188