SigLIP 2: Multilingual Vision-Language Encoders with Improved Semantic Understanding, Localization, and Dense Features Paper • 2502.14786 • Published 5 days ago • 115
Presumed Cultural Identity: How Names Shape LLM Responses Paper • 2502.11995 • Published 8 days ago • 10
view article Article PaliGemma 2 Mix - New Instruction Vision Language Models by Google 7 days ago • 58
view article Article Introducing Three New Serverless Inference Providers: Hyperbolic, Nebius AI Studio, and Novita 🔥 8 days ago • 89
Scaling Pre-training to One Hundred Billion Data for Vision Language Models Paper • 2502.07617 • Published 14 days ago • 28
view article Article From Chunks to Blocks: Accelerating Uploads and Downloads on the Hub 14 days ago • 49
OLMoE (January 2025) Collection Improved OLMoE for iOS app. Read more: https://allenai.org/blog/olmoe-app • 10 items • Updated 14 days ago • 9
view article Article From Llasa to Llasagna 🍕: Finetuning LLaSA to generates Italian speech and other languages By Steveeeeeeen and 1 other • 14 days ago • 25
High-Fidelity Simultaneous Speech-To-Speech Translation Paper • 2502.03382 • Published 20 days ago • 8
Hibiki fr-en Collection Hibiki is a model for streaming speech translation , which can run on device! See https://github.com/kyutai-labs/hibiki. • 5 items • Updated 19 days ago • 50
SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model Paper • 2502.02737 • Published 21 days ago • 192
view article Article π0 and π0-FAST: Vision-Language-Action Models for General Robot Control 22 days ago • 107