Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Tasks
Libraries
Datasets
Languages
Licenses
Other
1
Inference status
Reset Inference status
Warm
Cold
Frozen
Misc
Reset Misc
multimodal
Inference Endpoints
text-generation-inference
AutoTrain Compatible
custom_code
4-bit precision
Merge
Eval Results
8-bit precision
Mixture of Experts
Misc with no match
text-embeddings-inference
Carbon Emissions
Apply filters
Models
468
Full-text search
Edit filters
Sort: Trending
Active filters:
multimodal
Clear all
lmstudio-community/UI-TARS-2B-SFT-GGUF
Image-Text-to-Text
•
Updated
3 days ago
•
290
•
1
lmstudio-community/UI-TARS-72B-DPO-GGUF
Image-Text-to-Text
•
Updated
3 days ago
•
188
•
1
vincentamato/ARIA
Updated
1 day ago
•
1
Sci-fi-vy/Llama-3.2-11B-Vision-Instruct-finetuned
Image-Text-to-Text
•
Updated
about 7 hours ago
•
1
sujitpal/clip-imageclef
Zero-Shot Image Classification
•
Updated
Oct 31, 2023
•
16
•
3
waybarrios/guidance-based-video-grounding
Updated
Apr 1, 2023
MonoHime/mosei-senti-intermodal
Feature Extraction
•
Updated
May 18, 2023
•
4
MonoHime/mosei-emo-intermodal
Feature Extraction
•
Updated
May 18, 2023
•
3
MonoHime/iemocap-emo-intermodal
Feature Extraction
•
Updated
May 18, 2023
•
4
MonoHime/mosi-senti-intermodal
Feature Extraction
•
Updated
May 18, 2023
•
3
MonoHime/meld-emo-intermodal
Feature Extraction
•
Updated
May 18, 2023
•
3
HuggingFaceM4/idefics-9b
Text Generation
•
Updated
Oct 12, 2023
•
5.31k
•
47
HuggingFaceM4/idefics-9b-instruct
Text Generation
•
Updated
Oct 12, 2023
•
23.9k
•
104
HuggingFaceM4/idefics-80b-instruct
Text Generation
•
Updated
Oct 12, 2023
•
1.58k
•
181
typeof/idefics-9b
Text Generation
•
Updated
Oct 13, 2023
•
8
sshh12/Mistral-7B-LoRA-VisionCLIP-LLAVA
Text Generation
•
Updated
Oct 28, 2023
•
12
•
9
sshh12/Mistral-7B-LoRA-ImageBind-LLAVA
Text Generation
•
Updated
Nov 2, 2023
•
17
•
11
sshh12/Mistral-7B-LoRA-DocumentGTE-260K-x128
Text Generation
•
Updated
Nov 4, 2023
•
16
•
3
emma-heriot-watt/models
Updated
Dec 5, 2023
•
2
xun/Qwen-Audio-Chat-Int4
Text Generation
•
Updated
Dec 2, 2023
•
26
•
4
PsiPi/NousResearch_Nous-Hermes-2-Vision-GGUF
Image-Text-to-Text
•
Updated
Mar 11, 2024
•
882
•
15
sshh12/Mistral-7B-LoRA-VisionCLIPPool-LLAVA
Image-Text-to-Text
•
Updated
Mar 8, 2024
•
12
•
1
sshh12/Mistral-7B-LoRA-AudioWhisper
Updated
Dec 13, 2023
•
8
•
2
sshh12/Mistral-7B-LoRA-AudioCLAP
Updated
Dec 13, 2023
•
11
•
5
sshh12/Mistral-7B-LoRA-Multi-VisionCLIPPool-LLAVA
Image-Text-to-Text
•
Updated
Mar 27, 2024
•
13
•
2
sshh12/Mistral-7B-LoRA-XCLIP
Updated
Mar 27, 2024
•
11
•
1
Infi-MM/infimm-zephyr
Text Generation
•
Updated
Mar 6, 2024
•
8
•
10
Infi-MM/infimm-vicuna13b
Text Generation
•
Updated
Mar 6, 2024
•
10
•
3
Omega02gdfdd/Omega-bioclip
Zero-Shot Image Classification
•
Updated
Jan 7, 2024
marcosv/InstructIR
Image-to-Image
•
Updated
Jan 31, 2024
•
29
Previous
1
2
3
4
5
6
...
16
Next