MiniMax-01: Scaling Foundation Models with Lightning Attention Paper โข 2501.08313 โข Published 12 days ago โข 268
YuLan-Mini: An Open Data-efficient Language Model Paper โข 2412.17743 โข Published Dec 23, 2024 โข 64
Exploring the Abilities of Large Language Models to Solve Proportional Analogies via Knowledge-Enhanced Prompting Paper โข 2412.00869 โข Published Dec 1, 2024 โข 4
A Simple and Provable Scaling Law for the Test-Time Compute of Large Language Models Paper โข 2411.19477 โข Published Nov 29, 2024 โข 6