Collections
Discover the best community collections!
Collections including paper arxiv:2311.03079
-
GLaMM: Pixel Grounding Large Multimodal Model
Paper • 2311.03356 • Published • 33 -
CoVLM: Composing Visual Entities and Relationships in Large Language Models Via Communicative Decoding
Paper • 2311.03354 • Published • 4 -
CogVLM: Visual Expert for Pretrained Language Models
Paper • 2311.03079 • Published • 23 -
UnifiedVisionGPT: Streamlining Vision-Oriented AI through Generalized Multimodal Framework
Paper • 2311.10125 • Published • 4