DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning Paper • 2501.12948 • Published 5 days ago • 203
UI-TARS: Pioneering Automated GUI Interaction with Native Agents Paper • 2501.12326 • Published 5 days ago • 45
PaSa: An LLM Agent for Comprehensive Academic Paper Search Paper • 2501.10120 • Published 10 days ago • 37
From Novice to Expert: LLM Agent Policy Optimization via Step-wise Reinforcement Learning Paper • 2411.03817 • Published Nov 6, 2024 • 1
DigiRL: Training In-The-Wild Device-Control Agents with Autonomous Reinforcement Learning Paper • 2406.11896 • Published Jun 14, 2024 • 20