Beyond Release: Access Considerations for Generative AI Systems Paper • 2502.16701 • Published 2 days ago • 8
Forecasting Open-Weight AI Model Growth on Hugging Face Paper • 2502.15987 • Published 4 days ago • 7
MoBA: Mixture of Block Attention for Long-Context LLMs Paper • 2502.13189 • Published 7 days ago • 12
Presumed Cultural Identity: How Names Shape LLM Responses Paper • 2502.11995 • Published 8 days ago • 10
MaskGWM: A Generalizable Driving World Model with Video Mask Reconstruction Paper • 2502.11663 • Published 8 days ago • 36
SuperGPQA: Scaling LLM Evaluation across 285 Graduate Disciplines Paper • 2502.14739 • Published 5 days ago • 91
Exploring the Limit of Outcome Reward for Learning Mathematical Reasoning Paper • 2502.06781 • Published 15 days ago • 59
Revisiting the Test-Time Scaling of o1-like Models: Do they Truly Possess Test-Time Scaling Capabilities? Paper • 2502.12215 • Published 8 days ago • 15
Soundwave: Less is More for Speech-Text Alignment in LLMs Paper • 2502.12900 • Published 7 days ago • 75
Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention Paper • 2502.11089 • Published 9 days ago • 134
Step-Audio Collection Step-Audio model family, including Audio-Tokenizer, Audio-Chat and TTS • 3 items • Updated 8 days ago • 28
Lumina Family Collection Lumina-T2X is a unified framework for Text to Any Modality Generation • 8 items • Updated Jul 30, 2024 • 6
Ovis2 Collection Our latest advancement in multi-modal large language models (MLLMs) • 8 items • Updated 8 days ago • 51