arxiv:2501.12948
Shirong Ma
msr2000
AI & ML interests
None yet
Recent Activity
authored
a paper
1 day ago
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via
Reinforcement Learning
updated
a model
1 day ago
deepseek-ai/DeepSeek-R1-Zero
updated
a model
1 day ago
deepseek-ai/DeepSeek-R1-Distill-Qwen-7B