Running 1.62k 1.62k The Ultra-Scale Playbook 🌌 The ultimate guide to training LLM on large GPU Clusters
view article Article Releasing the largest multilingual open pretraining dataset By Pclanglais and 2 others • Nov 13, 2024 • 99
view article Article MedEmbed: Fine-Tuned Embedding Models for Medical / Clinical IR By abhinand • Oct 20, 2024 • 34
Snowflake/snowflake-arctic-embed-m-long Sentence Similarity • Updated Dec 13, 2024 • 28.2k • 34