7 562 139

蓋瑞王

gary109

AI & ML interests

GAN,Music

Recent Activity

liked a model 3 days ago

voidful/Llama-3.1-TAIDE-R1-8B-Chat

liked a model 6 days ago

perplexity-ai/r1-1776

upvoted an article 6 days ago

SmolVLM Grows Smaller – Introducing the 250M & 500M Models!

View all activity

Organizations

None yet

gary109's activity

upvoted 3 articles 6 days ago

Article

SmolVLM Grows Smaller – Introducing the 250M & 500M Models!

Jan 23

• 142

Article

The AI tools for Art Newsletter - Issue 1

27 days ago

• 66

Article

Fine-tuning SmolLM with Group Relative Policy Optimization (GRPO) by following the Methodologies

•

9 days ago

• 16

upvoted an article 12 days ago

Article

Training and Finetuning Embedding Models with Sentence Transformers v3

May 28, 2024

• 188

upvoted 3 articles 13 days ago

Article

From Files to Chunks: Improving Hugging Face Storage Efficiency

Nov 20, 2024

• 51

Article

Open-source DeepResearch – Freeing our search agents

23 days ago

• 1.1k

Article

Hugging Face x LangChain : A new partner package in LangChain

May 14, 2024

• 132

upvoted an article 14 days ago

Article

Open R1: Update #2

and 6 others •

16 days ago

• 187

upvoted a collection 16 days ago

Breeze 2 Family

Collection

Llama-Breeze2 is a multi-modal language model family specifically intended for Traditional Chinese use. BreezyVoice is a Taiwan Mandarin TTS • 6 items • Updated about 16 hours ago • 17

upvoted an article 26 days ago

Article

Open-R1: a fully open reproduction of DeepSeek-R1

30 days ago

• 776

upvoted a collection about 1 month ago

high-quality Chinese training datasets

Collection

a suite of high-quality Chinese datasets, used for pretraining, fine-tuning or preference alignment. And the models trained on these datasets. • 13 items • Updated 1 day ago • 11

upvoted a paper 3 months ago

TAPTRv3: Spatial and Temporal Context Foster Robust Tracking of Any Point in Long Video

Paper • 2411.18671 • Published Nov 27, 2024 • 20

upvoted a collection 4 months ago

LLM2CLIP

Collection

LLM2CLIP makes SOTA pretrained CLIP modal more SOTA ever. • 10 items • Updated Jan 8 • 55

upvoted a paper 4 months ago

StdGEN: Semantic-Decomposed 3D Character Generation from Single Images

Paper • 2411.05738 • Published Nov 8, 2024 • 15

upvoted 3 papers 5 months ago

Programming Every Example: Lifting Pre-training Data Quality like Experts at Scale

Paper • 2409.17115 • Published Sep 25, 2024 • 61

Seeing Faces in Things: A Model and Dataset for Pareidolia

Paper • 2409.16143 • Published Sep 24, 2024 • 17

RACER: Rich Language-Guided Failure Recovery Policies for Imitation Learning

Paper • 2409.14674 • Published Sep 23, 2024 • 43

upvoted 3 papers 6 months ago

WavTokenizer: an Efficient Acoustic Discrete Codec Tokenizer for Audio Language Modeling

Paper • 2408.16532 • Published Aug 29, 2024 • 49

Scaling Up Diffusion and Flow-based XGBoost Models

Paper • 2408.16046 • Published Aug 28, 2024 • 10

Segment Anything with Multiple Modalities

Paper • 2408.09085 • Published Aug 17, 2024 • 22