LLMs Lost in Translation: M-ALERT uncovers Cross-Linguistic Safety Gaps Paper • 2412.15035 • Published 6 days ago • 4 • 3
Revisiting In-Context Learning with Long Context Language Models Paper • 2412.16926 • Published 3 days ago • 14 • 2
PC Agent: While You Sleep, AI Works -- A Cognitive Journey into Digital World Paper • 2412.17589 • Published 1 day ago • 8 • 2
B-STaR: Monitoring and Balancing Exploration and Exploitation in Self-Taught Reasoners Paper • 2412.17256 • Published 2 days ago • 31 • 2
DRT-o1: Optimized Deep Reasoning Translation via Long Chain-of-Thought Paper • 2412.17498 • Published 2 days ago • 10 • 2
Distilled Decoding 1: One-step Sampling of Image Auto-regressive Models with Flow Matching Paper • 2412.17153 • Published 2 days ago • 26 • 2
OpenRFT: Adapting Reasoning Foundation Model for Domain-specific Tasks with Reinforcement Fine-Tuning Paper • 2412.16849 • Published 3 days ago • 5 • 2
Friends-MMC: A Dataset for Multi-modal Multi-party Conversation Understanding Paper • 2412.17295 • Published 2 days ago • 6 • 2
Agent-SafetyBench: Evaluating the Safety of LLM Agents Paper • 2412.14470 • Published 6 days ago • 6 • 2
RobustFT: Robust Supervised Fine-tuning for Large Language Models under Noisy Response Paper • 2412.14922 • Published 6 days ago • 66 • 2
Diving into Self-Evolving Training for Multimodal Reasoning Paper • 2412.17451 • Published 2 days ago • 26 • 2
NILE: Internal Consistency Alignment in Large Language Models Paper • 2412.16686 • Published 4 days ago • 6 • 2
Outcome-Refining Process Supervision for Code Generation Paper • 2412.15118 • Published 6 days ago • 12 • 2