Unifying Specialized Visual Encoders for Video Language Models Paper โข 2501.01426 โข Published 25 days ago โข 21
Summary of a Haystack: A Challenge to Long-Context LLMs and RAG Systems Paper โข 2407.01370 โข Published Jul 1, 2024 โข 86
Salesforce/xgen-mm-phi3-mini-instruct-r-v1 Image-Text-to-Text โข Updated Sep 18, 2024 โข 1.36k โข 186
UniControl: A Unified Diffusion Model for Controllable Visual Generation In the Wild Paper โข 2305.11147 โข Published May 18, 2023 โข 3