SigLIP 2: Multilingual Vision-Language Encoders with Improved Semantic Understanding, Localization, and Dense Features Paper • 2502.14786 • Published 14 days ago • 127
view article Article PaliGemma 2 Mix - New Instruction Vision Language Models by Google 16 days ago • 61
view article Article Introducing smolagents: simple agents that write actions in code. Dec 31, 2024 • 831
PaliGemma Release Collection Pretrained and mix checkpoints for PaliGemma • 16 items • Updated Dec 13, 2024 • 145