SYNTHETIC-1 Collection A collection of tasks & verifiers for reasoning datasets • 9 items • Updated 5 days ago • 48
An Open Recipe: Adapting Language-Specific LLMs to a Reasoning Model in One Day via Model Merging Paper • 2502.09056 • Published 12 days ago • 30
view article Article From Chunks to Blocks: Accelerating Uploads and Downloads on the Hub 14 days ago • 49
Hibiki fr-en Collection Hibiki is a model for streaming speech translation , which can run on device! See https://github.com/kyutai-labs/hibiki. • 5 items • Updated 19 days ago • 50
Reasoning Datasets Collection Distilled synthetic Reasoning datasets • 7 items • Updated 23 days ago • 55
Qwen2.5-VL Collection Vision-language model series based on Qwen2.5 • 8 items • Updated 1 day ago • 367
view article Article Introducing the Synthetic Data Generator - Build Datasets with Natural Language Dec 16, 2024 • 107
view article Article Train 400x faster Static Embedding Models with Sentence Transformers Jan 15 • 149
InternVL2.5-MPO Collection Enhancing the Reasoning Ability of MLLMs via Mixed Preference Optimization • 16 items • Updated 27 days ago • 26