An Empirical Study on Eliciting and Improving R1-like Reasoning Models Paper • 2503.04548 • Published 4 days ago • 5
R1-Searcher: Incentivizing the Search Capability in LLMs via Reinforcement Learning Paper • 2503.05592 • Published 3 days ago • 12