Improved Training Technique for Latent Consistency Models Paper • 2502.01441 • Published 22 days ago • 8
SafeRAG: Benchmarking Security in Retrieval-Augmented Generation of Large Language Model Paper • 2501.18636 • Published 28 days ago • 28
Preference Leakage: A Contamination Problem in LLM-as-a-judge Paper • 2502.01534 • Published 22 days ago • 38
Can LLMs Maintain Fundamental Abilities under KV Cache Compression? Paper • 2502.01941 • Published 22 days ago • 14
Satori: Reinforcement Learning with Chain-of-Action-Thought Enhances LLM Reasoning via Autoregressive Search Paper • 2502.02508 • Published 21 days ago • 22
Step Back to Leap Forward: Self-Backtracking for Boosting Reasoning of Language Models Paper • 2502.04404 • Published 20 days ago • 22
Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach Paper • 2502.05171 • Published 18 days ago • 117
Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling Paper • 2502.06703 • Published 15 days ago • 136
Expect the Unexpected: FailSafe Long Context QA for Finance Paper • 2502.06329 • Published 15 days ago • 124
TransMLA: Multi-head Latent Attention Is All You Need Paper • 2502.07864 • Published 14 days ago • 44
CoT-Valve: Length-Compressible Chain-of-Thought Tuning Paper • 2502.09601 • Published 12 days ago • 14
MME-CoT: Benchmarking Chain-of-Thought in Large Multimodal Models for Reasoning Quality, Robustness, and Efficiency Paper • 2502.09621 • Published 12 days ago • 27
ImageRAG: Dynamic Image Retrieval for Reference-Guided Image Generation Paper • 2502.09411 • Published 12 days ago • 17
How Do LLMs Acquire New Knowledge? A Knowledge Circuits Perspective on Continual Pre-Training Paper • 2502.11196 • Published 9 days ago • 21