MiniGPT4-Video: Advancing Multimodal LLMs for Video Understanding with Interleaved Visual-Textual Tokens Paper • 2404.03413 • Published Apr 4, 2024 • 27
openai/clip-vit-large-patch14-336 Zero-Shot Image Classification • Updated Oct 4, 2022 • 5.11M • • 228