Running 1.88k 1.88k The Ultra-Scale Playbook 🌌 The ultimate guide to training LLM on large GPU Clusters
SMOSE: Sparse Mixture of Shallow Experts for Interpretable Reinforcement Learning in Continuous Control Tasks Paper • 2412.13053 • Published Dec 17, 2024
argilla/ultrafeedback-binarized-preferences-cleaned Viewer • Updated Dec 11, 2023 • 60.9k • 5.05k • 136
Running 222 222 AI2 WildBench Leaderboard (V2) 🦁 Display and explore model leaderboards and chat history