Lucas Beyer

giffmana

AI & ML interests

None yet

Recent Activity

Organizations

gg-hf's profile picture big_vision's profile picture

giffmana's activity

New activity in google/siglip-so400m-patch14-384 33 minutes ago
view reply

Sorry i can't collaborate with individual's papers.

view reply

Ah sorry that wasn't clear from your message. I'm not familiar enough with this codebase to help more.

view reply

The warning gives you the answer: pass max_length=64

view reply

Yes. If you want longer text, what I'd do is chunk it into pieces of 64 tokens (possibly even overlapping), embed those separately, and either average their endings or dot them with the image embedding individually and take max or average score, depending on your use case.

I'm actually curious what kind of queries you're dealing with that are longer than 64 tokens? All use cases of siglip i can think of almost always fit in way below 64.

New activity in google/siglip-large-patch16-384 5 months ago

upload fast tokenizer.json

#1 opened 6 months ago by
itazap
New activity in google/siglip-large-patch16-256 5 months ago

Upload tokenizer.json

#2 opened 6 months ago by
itazap
New activity in google/siglip-base-patch16-256 5 months ago

Upload tokenizer.json

#1 opened 6 months ago by
itazap
New activity in google/siglip-base-patch16-384 5 months ago

Upload tokenizer.json

#1 opened 6 months ago by
itazap
New activity in google/siglip-base-patch16-512 5 months ago

upload fast tokenizer.json

#2 opened 6 months ago by
itazap