Qwen2.5-VL Collection Vision-language model series based on Qwen2.5 • 8 items • Updated 3 days ago • 371
🎭 Avatars Collection The latest AI-powered technologies usher in a new era of realistic avatars! 🚀 • 71 items • Updated 7 days ago • 81
Loopy: Taming Audio-Driven Portrait Avatar with Long-Term Motion Dependency Paper • 2409.02634 • Published Sep 4, 2024 • 94
Ferret-UI: Grounded Mobile UI Understanding with Multimodal LLMs Paper • 2404.05719 • Published Apr 8, 2024 • 82