Merve Noyan's picture

Merve Noyan

merve

·

https://github.com/merveenoyan/smol-vision

AI & ML interests

VLMs, vision & co

Recent Activity

liked a dataset 12 minutes ago

allenai/olmOCR-mix-0225

upvoted a collection 12 minutes ago

liked a model 12 minutes ago

allenai/olmOCR-7B-0225-preview

View all activity

Organizations

Posts 98

Post

4912

Google just released PaliGemma 2 Mix: new versatile instruction vision language models 🔥

> Three new models: 3B, 10B, 28B with res 224, 448 💙
> Can do vision language tasks with open-ended prompts, understand documents, and segment or detect anything 🤯

Read more https://huggingface.co/blog/paligemma2mix
Try the demo google/paligemma2-10b-mix
All models are here google/paligemma-2-mix-67ac6a251aaf3ee73679dcc4

Articles 23

Article

90

SigLIP 2: A better multilingual vision language encoder

View all Articles

Collections 44

spaces 104

Vision Papers

All paper summaries read by Merve

Running on Zero

Paligemma2 Vqav2

PaliGemma2 LoRA finetuned on VQAv2

Running on Zero

OWLSAM

State-of-the-art open-vocabulary image segmentation ⚡️

Sam2.1

SuperPoint

Identify key points in an image

Running on CPU Upgrade

Gradio Tgi

models 92

merve/SmolVLM2-500M-Video-Instruct-video-feedback

Image-Text-to-Text • Updated 5 days ago • 9

merve/SmolVLM2-500M-Video-Instruct-videofeedback

Image-Text-to-Text • Updated 5 days ago • 9

merve/SmolVLM2-500M-Video-Instruct-emotions

Image-Text-to-Text • Updated 5 days ago • 13

merve/colpali_ufo

Updated Dec 20, 2024 • 2

merve/paligemma_vqav2

Image-Text-to-Text • Updated Dec 18, 2024 • 220 • 13

merve/paligemma2-3b-vqav2

Updated Dec 5, 2024 • 1.45k • 6

merve/google-ckpts

Updated Oct 22, 2024

merve/google-tokenizers

Updated Oct 22, 2024

merve/idefics3-llama-vqav2

Updated Sep 11, 2024

merve/idefics3llama-vqav2

Updated Sep 11, 2024 • 8

datasets 26

merve/model-test-inputs

Updated Oct 21, 2024 • 44

merve/vqav2-small

Viewer • Updated Aug 8, 2024 • 21.4k • 1.73k • 9

merve/SGinW

Viewer • Updated Jul 11, 2024 • 16.7k • 3.41k • 1

merve/pascal-voc

Viewer • Updated Jul 6, 2024 • 336k • 1.29k

merve/YouCook2

Viewer • Updated May 28, 2024 • 2k • 99

merve/faiss_embeddings

Updated Jan 25, 2024 • 16

merve/pokemon-ds-embeddings

Viewer • Updated Jan 10, 2024 • 833 • 74 • 4

merve/tr-h4-norobots

Updated Jan 7, 2024 • 108 • 10

merve/lego_sets_latest

Viewer • Updated Jan 6, 2024 • 61 • 522 • 4

merve/ai-tube-dummy

Updated Dec 1, 2023 • 44