microsoft/Phi-4-multimodal-instruct Automatic Speech Recognition β’ Updated 1 day ago β’ 231k β’ 1.03k
Running on Zero 423 423 Chat with DeepSeek-VL2-small π Generate responses using images and text input