ShowUI: One Vision-Language-Action Model for GUI Visual Agent Paper β’ 2411.17465 β’ Published Nov 26, 2024 β’ 78
Aguvis: Unified Pure Vision Agents for Autonomous GUI Interaction Paper β’ 2412.04454 β’ Published Dec 5, 2024 β’ 59
CogAgent: A Visual Language Model for GUI Agents Paper β’ 2312.08914 β’ Published Dec 14, 2023 β’ 29
OS-Genesis: Automating GUI Agent Trajectory Construction via Reverse Task Synthesis Paper β’ 2412.19723 β’ Published 16 days ago β’ 78