Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
microsoft
/
Phi-4-multimodal-instruct
like
1.03k
Follow
Microsoft
10.1k
Automatic Speech Recognition
Transformers
Safetensors
24 languages
phi4mm
text-generation
nlp
code
audio
speech-summarization
speech-translation
visual-question-answering
phi-4-multimodal
phi
phi-4-mini
custom_code
arxiv:
2407.13833
License:
mit
Model card
Files
Files and versions
Community
36
Train
Use this model
refs/pr/8
Phi-4-multimodal-instruct
/
examples
/
what_is_the_traffic_sign_in_the_image.wav
nguyenbh
Add examples
bd4b39b
12 days ago
download
Copy download link
history
Safe
741 kB
This file contains binary data. It cannot be displayed, but you can still
download
it.