SepLLM: Accelerate Large Language Models by Compressing One Segment into One Separator Paper • 2412.12094 • Published Dec 16, 2024 • 10
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models Paper • 2306.07691 • Published Jun 13, 2023 • 6
iSTFTNet: Fast and Lightweight Mel-Spectrogram Vocoder Incorporating Inverse Short-Time Fourier Transform Paper • 2203.02395 • Published Mar 4, 2022
MiniMax-01: Scaling Foundation Models with Lightning Attention Paper • 2501.08313 • Published 24 days ago • 273
Search-o1: Agentic Search-Enhanced Large Reasoning Models Paper • 2501.05366 • Published 29 days ago • 91
MinMo: A Multimodal Large Language Model for Seamless Voice Interaction Paper • 2501.06282 • Published 28 days ago • 43
An Empirical Study of Autoregressive Pre-training from Videos Paper • 2501.05453 • Published 29 days ago • 37
The Lessons of Developing Process Reward Models in Mathematical Reasoning Paper • 2501.07301 • Published 25 days ago • 89
Inference-Time Scaling for Diffusion Models beyond Scaling Denoising Steps Paper • 2501.09732 • Published 22 days ago • 67
Towards Large Reasoning Models: A Survey of Reinforced Reasoning with Large Language Models Paper • 2501.09686 • Published 22 days ago • 36
VideoLLaMA 3: Frontier Multimodal Foundation Models for Image and Video Understanding Paper • 2501.13106 • Published 16 days ago • 79
SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training Paper • 2501.17161 • Published 10 days ago • 100
SANA 1.5: Efficient Scaling of Training-Time and Inference-Time Compute in Linear Diffusion Transformer Paper • 2501.18427 • Published 8 days ago • 16
Preference Leakage: A Contamination Problem in LLM-as-a-judge Paper • 2502.01534 • Published 4 days ago • 34
Token Assorted: Mixing Latent and Text Tokens for Improved Language Model Reasoning Paper • 2502.03275 • Published 2 days ago • 9
LayerTracer: Cognitive-Aligned Layered SVG Synthesis via Diffusion Transformer Paper • 2502.01105 • Published 4 days ago • 11