A Survey of Safety on Large Vision-Language Models: Attacks, Defenses and Evaluations Paper • 2502.14881 • Published 12 days ago • 1
VLM^2-Bench: A Closer Look at How Well VLMs Implicitly Link Explicit Matching Visual Cues Paper • 2502.12084 • Published 8 days ago • 29
MME-CoT: Benchmarking Chain-of-Thought in Large Multimodal Models for Reasoning Quality, Robustness, and Efficiency Paper • 2502.09621 • Published 12 days ago • 27