deepseek-ai/DeepSeek-R1-Distill-Qwen-32B Text Generation • Updated about 2 hours ago • 98.1k • • 496
facebook/roberta-hate-speech-dynabench-r4-target Text Classification • Updated Mar 16, 2023 • 1.48M • 70
rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking Paper • 2501.04519 • Published 18 days ago • 248
REINFORCE++: A Simple and Efficient Approach for Aligning Large Language Models Paper • 2501.03262 • Published 22 days ago • 87