Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
lmms-lab
's Collections
VideoMMMU
Multimodal-SAE
LLaVA-Critic
LLaVA-Video
LLaVA-OneVision
LMMs-Eval
LongVA
LLaVA-Next-Interleave
LLaVA-NeXT
LMMs-Eval-Lite
LLaVA-OneVision
updated
Oct 5, 2024
a model good at arbitrary types of visual input
Upvote
22
+12
LLaVA-OneVision: Easy Visual Task Transfer
Paper
•
2408.03326
•
Published
Aug 6, 2024
•
60
lmms-lab/LLaVA-OneVision-Mid-Data
Viewer
•
Updated
Aug 26, 2024
•
563k
•
666
•
19
lmms-lab/LLaVA-OneVision-Data
Viewer
•
Updated
Oct 22, 2024
•
3.72M
•
26.3k
•
167
lmms-lab/LLaVA-NeXT-Data
Viewer
•
Updated
Aug 30, 2024
•
779k
•
1.94k
•
29
lmms-lab/llavanext-qwen-siglip-tokenizer
Text Generation
•
Updated
Jul 11, 2024
•
24
•
3
lmms-lab/llava-onevision-qwen2-0.5b-si
Text Generation
•
Updated
Sep 2, 2024
•
8.63k
•
13
lmms-lab/llava-onevision-qwen2-0.5b-ov
Text Generation
•
Updated
Sep 2, 2024
•
41.8k
•
17
lmms-lab/llava-onevision-qwen2-7b-si
Text Generation
•
Updated
Sep 2, 2024
•
7.94k
•
12
lmms-lab/llava-onevision-qwen2-7b-ov
Text Generation
•
Updated
Sep 2, 2024
•
95.2k
•
45
lmms-lab/llava-onevision-qwen2-72b-si
Text Generation
•
Updated
Sep 2, 2024
•
593
•
1
lmms-lab/llava-onevision-qwen2-72b-ov-sft
Text Generation
•
Updated
Sep 2, 2024
•
3.32k
•
14
lmms-lab/llava-onevision-qwen2-72b-ov-chat
Image-Text-to-Text
•
Updated
Oct 9, 2024
•
1.35k
•
8
lmms-lab/llava-onevision-projectors
Updated
Aug 14, 2024
•
3
lmms-lab/llava-onevision-qwen2-0.5b-mid-stage-a4
Updated
Aug 6, 2024
•
246
lmms-lab/llava-onevision-qwen2-7b-mid-stage-a4
Updated
Aug 6, 2024
•
115
Upvote
22
+18
Share collection
View history
Collection guide
Browse collections