Vision Models (GGUF) Collection How to use: Download a "mmproj" model file + one or more of the primary model files. โข 5 items โข Updated Dec 22, 2023 โข 42
Groma: Localized Visual Tokenization for Grounding Multimodal Large Language Models Paper โข 2404.13013 โข Published Apr 19 โข 30
Multimodal Foundation Models: From Specialists to General-Purpose Assistants Paper โข 2309.10020 โข Published Sep 18, 2023 โข 40