R1-T1: Fully Incentivizing Translation Capability in LLMs via Reasoning Learning Paper • 2502.19735 • Published 2 days ago • 6
Guardians of the Agentic System: Preventing Many Shots Jailbreak with Agentic System Paper • 2502.16750 • Published 5 days ago • 7
FINEREASON: Evaluating and Improving LLMs' Deliberate Reasoning through Reflective Puzzle Solving Paper • 2502.20238 • Published 1 day ago • 20
MedVLM-R1: Incentivizing Medical Reasoning Capability of Vision-Language Models (VLMs) via Reinforcement Learning Paper • 2502.19634 • Published 2 days ago • 44
UniTok: A Unified Tokenizer for Visual Generation and Understanding Paper • 2502.20321 • Published 1 day ago • 15
SoRFT: Issue Resolving with Subtask-oriented Reinforced Fine-Tuning Paper • 2502.20127 • Published 1 day ago • 7
R2-T2: Re-Routing in Test-Time for Multimodal Mixture-of-Experts Paper • 2502.20395 • Published 1 day ago • 33
CODESYNC: Synchronizing Large Language Models with Dynamic Code Evolution at Scale Paper • 2502.16645 • Published 6 days ago • 16
CritiQ: Mining Data Quality Criteria from Human Preferences Paper • 2502.19279 • Published 3 days ago • 6
Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems Paper • 2502.19328 • Published 3 days ago • 17
Can Large Language Models Detect Errors in Long Chain-of-Thought Reasoning? Paper • 2502.19361 • Published 3 days ago • 20
TheoremExplainAgent: Towards Multimodal Explanations for LLM Theorem Understanding Paper • 2502.19400 • Published 2 days ago • 34
Shakti-VLMs: Scalable Vision-Language Models for Enterprise AI Paper • 2502.17092 • Published 5 days ago • 3
ART: Anonymous Region Transformer for Variable Multi-Layer Transparent Image Generation Paper • 2502.18364 • Published 4 days ago • 29
AAD-LLM: Neural Attention-Driven Auditory Scene Understanding Paper • 2502.16794 • Published 5 days ago • 4
Scale-Distribution Decoupling: Enabling Stable and Effective Training of Large Language Models Paper • 2502.15499 • Published 8 days ago • 12