deepseek-ai/DeepSeek-R1-Distill-Llama-8B Text Generation • Updated 7 days ago • 1.38M • • 618
Running 1.85k 1.85k The Ultra-Scale Playbook 🌌 The ultimate guide to training LLM on large GPU Clusters
mistralai/Mistral-Small-24B-Instruct-2501 Text Generation • Updated 28 days ago • 775k • • 835