arxiv:2501.12948
Junxiao Song
haha-point
AI & ML interests
None yet
Recent Activity
authored
a paper
3 days ago
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via
Reinforcement Learning
authored
a paper
5 months ago
DeepSeek-Prover-V1.5: Harnessing Proof Assistant Feedback for
Reinforcement Learning and Monte-Carlo Tree Search
Organizations
None yet
models
None public yet
datasets
None public yet