Introducing Visual Perception Token into Multimodal Large Language Model Paper • 2502.17425 • Published 7 days ago • 12 • 2
Attention Prompting on Image for Large Vision-Language Models Paper • 2409.17143 • Published Sep 25, 2024 • 7 • 2