InternViT-6B + QLLaMA, can be used for image-text retrieval like CLIP
3
#5 opened about 1 month ago
by
vitvit
Fix incorrect image embedding when running with a single GPU and 24GB VRAM
1
#3 opened 8 months ago
by
xdedss
![](https://cdn-avatars.huggingface.co/v1/production/uploads/64c8c29d1825d12864c7b83d/_umr_Xd_eS0koLsxuBuL5.jpeg)