Image-to-text models Collection of image captioning models Salesforce/blip-image-captioning-large Image-to-Text • Updated about 1 month ago • 1.48M • • 1.29k microsoft/git-large-coco Image-to-Text • Updated Jun 26, 2023 • 21.9k • • 103 Salesforce/instructblip-vicuna-7b Image-Text-to-Text • Updated about 1 month ago • 261k • 89 Salesforce/blip2-flan-t5-xxl Image-Text-to-Text • Updated about 1 month ago • 7.88k • 86
SigLIP release SigLIP improves upon CLIP with a sigmoid loss. Both English-only and multilingual checkpoints are released. Sigmoid Loss for Language Image Pre-Training Paper • 2303.15343 • Published Mar 27, 2023 • 8 google/siglip-base-patch16-224 Zero-Shot Image Classification • Updated Sep 26, 2024 • 340k • • 37 google/siglip-base-patch16-256 Zero-Shot Image Classification • Updated Sep 26, 2024 • 4.32k • 5 google/siglip-base-patch16-384 Zero-Shot Image Classification • Updated Sep 26, 2024 • 2.85k • 10