MiniMax-01: Scaling Foundation Models with Lightning Attention Paper β’ 2501.08313 β’ Published 12 days ago β’ 268
STAR: Spatial-Temporal Augmentation with Text-to-Video Models for Real-World Video Super-Resolution Paper β’ 2501.02976 β’ Published 20 days ago β’ 52
TransPixar: Advancing Text-to-Video Generation with Transparency Paper β’ 2501.03006 β’ Published 20 days ago β’ 22
METAGENE-1: Metagenomic Foundation Model for Pandemic Monitoring Paper β’ 2501.02045 β’ Published 23 days ago β’ 21
TAPTRv3: Spatial and Temporal Context Foster Robust Tracking of Any Point in Long Video Paper β’ 2411.18671 β’ Published Nov 27, 2024 β’ 20
OpenCoder: The Open Cookbook for Top-Tier Code Large Language Models Paper β’ 2411.04905 β’ Published Nov 7, 2024 β’ 113
MMSearch: Benchmarking the Potential of Large Models as Multi-modal Search Engines Paper β’ 2409.12959 β’ Published Sep 19, 2024 β’ 37