Deepseek Papers Collection Deepseek papers collection • 15 items • Updated about 9 hours ago • 39
Tulu 3 Models Collection All models released with Tulu 3 -- state of the art open post-training recipes. • 10 items • Updated 6 days ago • 85
mistralai/Mistral-Small-24B-Instruct-2501 Text Generation • Updated 2 days ago • 18.4k • • 564
TinySwallow Collection Compact Japanese models trained with "TAID: Temporally Adaptive Interpolated Distillation for Efficient Knowledge Transfer in Language Models" • 5 items • Updated 6 days ago • 12
YuE Collection YuE is a groundbreaking series of open-source foundation models designed for music generation, led by HKUST. • 7 items • Updated 6 days ago • 2