Centurio: On Drivers of Multilingual Ability of Large Vision-Language Model Paper • 2501.05122 • Published 3 days ago • 15 • 3
VidTwin: Video VAE with Decoupled Structure and Dynamics Paper • 2412.17726 • Published 20 days ago • 8 • 3
STAR: Spatial-Temporal Augmentation with Text-to-Video Models for Real-World Video Super-Resolution Paper • 2501.02976 • Published 6 days ago • 46 • 3
An Empirical Study of Autoregressive Pre-training from Videos Paper • 2501.05453 • Published 3 days ago • 29 • 6
rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking Paper • 2501.04519 • Published 4 days ago • 190 • 25