MiniMax-01: Scaling Foundation Models with Lightning Attention Paper • 2501.08313 • Published 17 days ago • 271
Llama 3.1 Collection This collection hosts the transformers and original repos of the Llama 3.1, Llama Guard 3 and Prompt Guard models • 11 items • Updated Dec 6, 2024 • 642
HGRN2 Collection HGRN2: Gated Linear RNNs with State Expansion • 2 items • Updated Jun 25, 2024 • 2
Scaling Laws for Linear Complexity Language Models Paper • 2406.16690 • Published Jun 24, 2024 • 23
Lightning Attention-2: A Free Lunch for Handling Unlimited Sequence Lengths in Large Language Models Paper • 2401.04658 • Published Jan 9, 2024 • 27
TransNormerLLM Collection TransNormerLLM: A Faster and Better Large Language Model with Improved TransNormer • 11 items • Updated Jun 25, 2024 • 3