Qwen2-VL Models with Visual Perception Token or used in training process.
Runpeng Yu
rp-yu
AI & ML interests
None yet
Recent Activity
authored
a paper
5 days ago
Introducing Visual Perception Token into Multimodal Large Language Model
upvoted
a
paper
5 days ago
Introducing Visual Perception Token into Multimodal Large Language Model
commented on
a paper
5 days ago
Introducing Visual Perception Token into Multimodal Large Language Model
Organizations
Collections
1
models
7
rp-yu/Qwen2-VL-2b-VPT-Det-NoPrompt
Image-Text-to-Text
•
Updated
•
6
rp-yu/Qwen2-VL-7b-VPT-CLIP
Image-Text-to-Text
•
Updated
•
11
rp-yu/Qwen2-VL-2b-VPT-Det
Image-Text-to-Text
•
Updated
•
10
rp-yu/Qwen2-VL-2b-VPT-Seg-Alignment
Image-Text-to-Text
•
Updated
•
26
rp-yu/Qwen2-VL-2b-VPT-CLIP
Image-Text-to-Text
•
Updated
•
31
•
1
rp-yu/Qwen2-VL-2b-VPT-Seg
Image-Text-to-Text
•
Updated
•
21
rp-yu/Qwen2-VL-2b-VPT-Det-Alignment
Image-Text-to-Text
•
Updated
•
20