Submitted by Hila 55 VideoJAM: Joint Appearance-Motion Representations for Enhanced Motion Generation in Video Models · 8 authors 6
Submitted by chaoscodes 20 Satori: Reinforcement Learning with Chain-of-Action-Thought Enhances LLM Reasoning via Autoregressive Search · 10 authors 2
Submitted by DaYin 15 QLASS: Boosting Language Agent Inference via Q-Guided Stepwise Search · 7 authors 2
Submitted by Dominic789654 13 Can LLMs Maintain Fundamental Abilities under KV Cache Compression? · 9 authors 2
Submitted by codelion 11 Rethinking Mixture-of-Agents: Is Mixing Different Large Language Models Beneficial? · 4 authors 4
Submitted by dahyekim 11 Concept Steerers: Leveraging K-Sparse Autoencoders for Controllable Generations · 2 authors 2
Submitted by ShizhaoSun 9 Text-to-CAD Generation Through Infusing Visual Feedback in Large Language Models · 4 authors 2
Submitted by xdeng77 9 COCONut-PanCap: Joint Panoptic Segmentation and Grounded Captions for Fine-Grained Understanding and Generation · 8 authors 2
Submitted by nupurkmr9 6 Generating Multi-Image Synthetic Data for Text-to-Image Customization · 5 authors 2
Submitted by ericzhao28 5 Sample, Scrutinize and Scale: Effective Inference-Time Search by Scaling Verification · 3 authors 2
Submitted by wenzhifang 3 Federated Sketching LoRA: On-Device Collaborative Fine-Tuning of Large Language Models · 5 authors 3
Submitted by ttttonyhe 1 Activation Approximations Can Incur Safety Vulnerabilities Even in Aligned LLMs: Comprehensive Analysis and Defense · 10 authors 3