https://aguvis-project.github.io
Yiheng Xu PRO
ranpox
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
1 day ago
Diving into Self-Evolving Training for Multimodal Reasoning
upvoted
a
collection
5 days ago
AGUVIS: Unified Pure Vision GUI Agents
updated
a collection
5 days ago
AGUVIS: Unified Pure Vision GUI Agents
Organizations
Collections
3
-
LayoutLM: Pre-training of Text and Layout for Document Image Understanding
Paper • 1912.13318 • Published • 2 -
microsoft/layoutlm-base-uncased
Updated • 2.24M • 47 -
LayoutLMv2: Multi-modal Pre-training for Visually-Rich Document Understanding
Paper • 2012.14740 • Published • 1 -
microsoft/layoutlmv2-base-uncased
Updated • 462k • 61
spaces
1
models
None public yet