TemporalBench: Benchmarking Fine-grained Temporal Understanding for Multimodal Video Models Paper • 2410.10818 • Published Oct 14, 2024 • 15
VGBench: Evaluating Large Language Models on Vector Graphics Understanding and Generation Paper • 2407.10972 • Published Jul 15, 2024 • 1
LLM as Dataset Analyst: Subpopulation Structure Discovery with Large Language Model Paper • 2405.02363 • Published May 3, 2024
Draw-and-Understand: Leveraging Visual Prompts to Enable MLLMs to Comprehend What You Want Paper • 2403.20271 • Published Mar 29, 2024 • 3
Split & Merge: Unlocking the Potential of Visual Adapters via Sparse Training Paper • 2312.02923 • Published Dec 5, 2023 • 1