view article Article ✴️ ScreenSpot-Pro: GUI Grounding for Professional High-Resolution Computer Use By Ziyang and 1 other • Jan 3 • 13
MotionBench: Benchmarking and Improving Fine-grained Video Motion Understanding for Vision Language Models Paper • 2501.02955 • Published Jan 6 • 40
VGA: Vision GUI Assistant -- Minimizing Hallucinations through Image-Centric Fine-Tuning Paper • 2406.14056 • Published Jun 20, 2024