Vaibhav Srivastav's picture

Vaibhav Srivastav PRO

reach-vb

·

https://vaibhavs10.github.io

AI & ML interests

TTS + LM performance prediction

Recent Activity

new activity about 5 hours ago

ai-starter-pack/README:huggingface pro subscription

upvoted an article 1 day ago

Remote VAEs for decoding with HF endpoints 🤗

liked a model 1 day ago

perplexity-ai/r1-1776-distill-llama-70b

View all activity

Organizations

reach-vb's activity

upvoted an article 1 day ago

Article

Remote VAEs for decoding with HF endpoints 🤗

2 days ago

• 25

upvoted a paper 4 days ago

SigLIP 2: Multilingual Vision-Language Encoders with Improved Semantic Understanding, Localization, and Dense Features

Paper • 2502.14786 • Published 5 days ago • 115

upvoted an article 4 days ago

Article

SigLIP 2: A better multilingual vision language encoder

5 days ago

• 90

upvoted a paper 5 days ago

Qwen2.5-VL Technical Report

Paper • 2502.13923 • Published 6 days ago • 143

upvoted an article 5 days ago

Article

SmolVLM2: Bringing Video Understanding to Every Device

6 days ago

• 161

upvoted a paper 5 days ago

Presumed Cultural Identity: How Names Shape LLM Responses

Paper • 2502.11995 • Published 8 days ago • 10

upvoted an article 6 days ago

Article

PaliGemma 2 Mix - New Instruction Vision Language Models by Google

7 days ago

• 58

upvoted a collection 6 days ago

PaliGemma 2 Mix

13 items • Updated 6 days ago • 59

upvoted 2 articles 7 days ago

Article

Introducing Three New Serverless Inference Providers: Hyperbolic, Nebius AI Studio, and Novita 🔥

8 days ago

• 89

Article

Welcome Fireworks.ai on the Hub 🎆

12 days ago

• 53

upvoted a paper 12 days ago

Scaling Pre-training to One Hundred Billion Data for Vision Language Models

Paper • 2502.07617 • Published 14 days ago • 28

upvoted an article 13 days ago

Article

From Chunks to Blocks: Accelerating Uploads and Downloads on the Hub

14 days ago

• 49

upvoted a collection 14 days ago

OLMoE (January 2025)

Improved OLMoE for iOS app. Read more: https://allenai.org/blog/olmoe-app • 10 items • Updated 14 days ago • 9

upvoted an article 14 days ago

Article

From Llasa to Llasagna 🍕: Finetuning LLaSA to generates Italian speech and other languages

By

and 1 other •

14 days ago

• 25

upvoted an article 15 days ago

Article

Open R1: Update #2

By

and 6 others •

15 days ago

• 185

upvoted a paper 18 days ago

High-Fidelity Simultaneous Speech-To-Speech Translation

Paper • 2502.03382 • Published 20 days ago • 8

upvoted a collection 19 days ago

Hibiki fr-en

Hibiki is a model for streaming speech translation , which can run on device! See https://github.com/kyutai-labs/hibiki. • 5 items • Updated 19 days ago • 50

upvoted 2 papers 19 days ago

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Paper • 2502.02737 • Published 21 days ago • 192

Fully Autonomous AI Agents Should Not be Developed

Paper • 2502.02649 • Published 21 days ago • 24

upvoted an article 21 days ago

Article

π0 and π0-FAST: Vision-Language-Action Models for General Robot Control

22 days ago

• 107