Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
VITA-MLLM
/
VITA-1.5
like
33
Follow
VITA-MLLM
38
Video-Text-to-Text
Safetensors
vita-Qwen2
arxiv:
2501.01957
Model card
Files
Files and versions
Community
2
main
VITA-1.5
/
audio-encoder-Qwen2-7B-1107-weight-base-11wh-tunning
/
train.yaml
shenyunhang
add all
077821d
about 2 months ago
raw
Copy download link
history
contribute
delete
Safe
364 kB
File too large to display, you can
check the raw version
instead.