Submitted by akhaliq 78 Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach · 9 authors 11
Submitted by myownskyW7 60 VideoRoPE: What Makes for Good Video Rotary Position Embedding? · 12 authors 2
Submitted by d-alistarh 38 QuEST: Stable Training of LLMs with 1-Bit Weights and Activations · 6 authors 3
Submitted by yulunliu 26 AuraFusion360: Augmented Unseen Region Alignment for Reference-based 360° Unbounded Scene Inpainting · 11 authors 3
Submitted by akhaliq 20 FlashVideo:Flowing Fidelity to Detail for Efficient High-Resolution Video Generation · 10 authors 3
Submitted by ydeng9 19 DuoGuard: A Two-Player RL-Driven Framework for Multilingual LLM Guardrails · 5 authors 2
Submitted by akhaliq 18 Step Back to Leap Forward: Self-Backtracking for Boosting Reasoning of Language Models · 8 authors 2
Submitted by akhaliq 16 Generating Symbolic World Models via Test-time Scaling of Large Language Models · 8 authors 2
Submitted by danielm1405 10 No Task Left Behind: Isotropic Model Merging with Common and Task-Specific Subspaces · 6 authors 2
Submitted by Eleven-P 10 CMoE: Fast Carving of Mixture-of-Experts for Efficient LLM Inference · 8 authors 2
Submitted by zhaoyue-zephyrus 10 QLIP: Text-Aligned Visual Tokenization Unifies Auto-Regressive Multimodal Understanding and Generation · 9 authors 2
Submitted by akhaliq 10 On-device Sora: Enabling Diffusion-Based Text-to-Video Generation for Mobile Devices · 6 authors 3
Submitted by akhaliq 10 CodeSteer: Symbolic-Augmented Language Models via Code/Text Guidance · 5 authors 3
Submitted by nielsr 9 Scaling Laws in Patchification: An Image Is Worth 50,176 Tokens And More · 7 authors 2
Submitted by akhaliq 9 Linear Correlation in LM's Compositional Generalization and Hallucination · 5 authors 3
Submitted by rohitsaxena 7 Lost in Time: Clock and Calendar Understanding Challenges in Multimodal LLMs · 3 authors 4
Submitted by yuweiyin 7 ARR: Question Answering with Large Language Models via Analyzing, Retrieving, and Reasoning · 2 authors 3
Submitted by amanchadha 5 YINYANG-ALIGN: Benchmarking Contradictory Objectives and Proposing Multi-Objective Optimization based DPO for Text-to-Image Alignment · 8 authors 2
Submitted by XiaotingQin 3 MEETING DELEGATE: Benchmarking LLMs on Attending Meetings on Our Behalf · 8 authors 3
Submitted by sinatayebati 2 SPARC: Subspace-Aware Prompt Adaptation for Robust Continual Learning in LLMs · 5 authors 2
Submitted by sinatayebati - Intelligent Sensing-to-Action for Robust Autonomy at the Edge: Opportunities and Challenges · 12 authors 2