Xin Li

lixin67

WilliamLeeBravo

AI & ML interests

None yet

Recent Activity

upvoted an article 6 days ago

Open R1: Update #2

upvoted an article 6 days ago

Open-R1: a fully open reproduction of DeepSeek-R1

liked a Space 6 days ago

nanotron/ultrascale-playbook

View all activity

Organizations

None yet

lixin67's activity

upvoted 2 articles 6 days ago

Article

Open R1: Update #2

and 6 others •

15 days ago

• 185

Article

Open-R1: a fully open reproduction of DeepSeek-R1

29 days ago

• 773

liked a Space 6 days ago

1.59k

The Ultra-Scale Playbook

🌌

The ultimate guide to training LLM on large GPU Clusters

upvoted an article 7 days ago

Article

SmolVLM Grows Smaller – Introducing the 250M & 500M Models!

Jan 23

• 142

liked 3 datasets 21 days ago

upvoted 2 papers about 2 months ago

Mulberry: Empowering MLLM with o1-like Reasoning and Reflection via Collective Monte Carlo Tree Search

Paper • 2412.18319 • Published Dec 24, 2024 • 37

YuLan-Mini: An Open Data-efficient Language Model

Paper • 2412.17743 • Published Dec 23, 2024 • 65

liked a model 2 months ago

Qwen/QVQ-72B-Preview

Image-Text-to-Text • Updated Jan 12 • 163k • • 548

liked a dataset 2 months ago

OpenGVLab/MMPR-v1.1

Preview • Updated Dec 21, 2024 • 591 • 40

liked a model 2 months ago

deepseek-ai/DeepSeek-V3-Base

Updated 2 days ago • 453k • 1.58k

liked a dataset 2 months ago

HuggingFaceTB/finemath

Viewer • Updated 20 days ago • 48.3M • 12.5k • 285

upvoted a collection 2 months ago

Open Image Preferences

Collection

Containing all artifacts for the Stable Diffusion 3.5L vs Flux Dev image preference community sprint. • 14 items • Updated Dec 19, 2024 • 9

liked a model 2 months ago

peiyi9979/math-shepherd-mistral-7b-prm

Text Generation • Updated Jan 15, 2024 • 3.05k • 45

upvoted a paper 2 months ago

Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters

Paper • 2408.03314 • Published Aug 6, 2024 • 56

liked a Space 2 months ago

522

Scaling test-time compute

📈

Enhance math problem solving by scaling test-time compute

liked 3 datasets 2 months ago

GAIR/o1-journey

Viewer • Updated Oct 16, 2024 • 327 • 290 • 132

Xkev/LLaVA-CoT-100k

Viewer • Updated Nov 27, 2024 • 98.6k • 3.39k • 74

O1-OPEN/OpenO1-SFT-Ultra

Viewer • Updated Dec 17, 2024 • 28M • 475 • 53