SANA 1.5: Efficient Scaling of Training-Time and Inference-Time Compute in Linear Diffusion Transformer Paper • 2501.18427 • Published Jan 30 • 17
Running on CPU Upgrade 12.7k 12.7k Open LLM Leaderboard 🏆 Track, rank and evaluate open LLMs and chatbots
Cautious Optimizers: Improving Training with One Line of Code Paper • 2411.16085 • Published Nov 25, 2024 • 19