One Vision-Language-Action Model for GUI Agent
Qinghong (Kevin) Lin PRO
KevinQHLin
AI & ML interests
Vision-Language Model, Video Understanding, Human-AI Interaction
Recent Activity
updated
a dataset
about 5 hours ago
KevinQHLin/azcopy
published
a dataset
about 5 hours ago
KevinQHLin/azcopy
upvoted
a
paper
1 day ago
PhotoDoodle: Learning Artistic Image Editing from Few-Shot Pairwise Data