LLM-Microscope: Uncovering the Hidden Role of Punctuation in Context Memory of Transformers Paper • 2502.15007 • Published 5 days ago • 135
X-Dancer: Expressive Music to Human Dance Video Generation Paper • 2502.17414 • Published 1 day ago • 8
Slamming: Training a Speech Language Model on One GPU in a Day Paper • 2502.15814 • Published 7 days ago • 42
SigLIP 2: Multilingual Vision-Language Encoders with Improved Semantic Understanding, Localization, and Dense Features Paper • 2502.14786 • Published 5 days ago • 115
Reducing Hallucinations in Language Model-based SPARQL Query Generation Using Post-Generation Memory Retrieval Paper • 2502.13369 • Published 7 days ago • 2
InfiR : Crafting Effective Small Language Models and Multimodal Small Language Models in Reasoning Paper • 2502.11573 • Published 9 days ago • 8
SongGen: A Single Stage Auto-regressive Transformer for Text-to-Song Generation Paper • 2502.13128 • Published 7 days ago • 35
SoFar: Language-Grounded Orientation Bridges Spatial Reasoning and Object Manipulation Paper • 2502.13143 • Published 7 days ago • 29
Multimodal Mamba: Decoder-only Multimodal State Space Model via Quadratic to Linear Distillation Paper • 2502.13145 • Published 7 days ago • 35
Phantom: Subject-consistent video generation via cross-modal alignment Paper • 2502.11079 • Published 10 days ago • 50
Rethinking Diverse Human Preference Learning through Principal Component Analysis Paper • 2502.13131 • Published 7 days ago • 34
FoNE: Precise Single-Token Number Embeddings via Fourier Features Paper • 2502.09741 • Published 12 days ago • 11
ImageRAG: Dynamic Image Retrieval for Reference-Guided Image Generation Paper • 2502.09411 • Published 13 days ago • 17
Ask in Any Modality: A Comprehensive Survey on Multimodal Retrieval-Augmented Generation Paper • 2502.08826 • Published 13 days ago • 16
TripoSG: High-Fidelity 3D Shape Synthesis using Large-Scale Rectified Flow Models Paper • 2502.06608 • Published 16 days ago • 32
Retrieval-augmented Large Language Models for Financial Time Series Forecasting Paper • 2502.05878 • Published 17 days ago • 38