3 91 140

zhangwenbin

ExceedZhang

AI & ML interests

None yet

Recent Activity

upvoted a paper about 16 hours ago

Humanity's Last Exam

upvoted a paper about 16 hours ago

Qwen2.5-1M Technical Report

upvoted an article 2 days ago

We now support VLMs in smolagents!

View all activity

Organizations

None yet

ExceedZhang's activity

upvoted 2 papers about 16 hours ago

Humanity's Last Exam

Paper • 2501.14249 • Published 6 days ago • 44

Qwen2.5-1M Technical Report

Paper • 2501.15383 • Published 4 days ago • 38

upvoted 2 articles 2 days ago

Article

We now support VLMs in smolagents!

6 days ago

• 63

Article

Open-R1: a fully open reproduction of DeepSeek-R1

2 days ago

• 406

liked 2 models 2 days ago

Qwen/Qwen2.5-VL-7B-Instruct

Image-Text-to-Text • Updated 2 days ago • 18.4k • 185

Qwen/Qwen2.5-VL-72B-Instruct

Image-Text-to-Text • Updated 2 days ago • 2.85k • 152

upvoted a paper 7 days ago

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published 8 days ago • 262

updated a model 8 days ago

ExceedZhang/DeepSeek-R1-Distill-Qwen-14B-W4A16-G128

Updated 8 days ago • 4

published a model 8 days ago

ExceedZhang/DeepSeek-R1-Distill-Qwen-14B-W4A16-G128

Updated 8 days ago • 4

liked a model 9 days ago

deepseek-ai/DeepSeek-R1-Distill-Qwen-32B

Text Generation • Updated 4 days ago • 165k • 718

upvoted a paper 15 days ago

MiniMax-01: Scaling Foundation Models with Lightning Attention

Paper • 2501.08313 • Published 15 days ago • 270

liked a model 15 days ago

microsoft/phi-4

Text Generation • Updated 21 days ago • 265k • 1.61k

upvoted a paper 20 days ago

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

Paper • 2501.04519 • Published 22 days ago • 249

liked a model 21 days ago

ICTNLP/llava-mini-llama-3.1-8b

Image-Text-to-Text • Updated 17 days ago • 6.47k • 37

upvoted 2 papers 22 days ago

VITA-1.5: Towards GPT-4o Level Real-Time Vision and Speech Interaction

Paper • 2501.01957 • Published 26 days ago • 42

REINFORCE++: A Simple and Efficient Approach for Aligning Large Language Models

Paper • 2501.03262 • Published 26 days ago • 89

liked a model 22 days ago

nvidia/Cosmos-1.0-Diffusion-7B-Text2World

Updated 20 days ago • 217k • 195

liked a model 23 days ago

VITA-MLLM/VITA-1.5

Video-Text-to-Text • Updated 14 days ago • 713 • 33

liked 2 models 24 days ago

FunAudioLLM/CosyVoice2-0.5B

Text-to-Speech • Updated Dec 27, 2024 • 7.58k • 50

Qwen/Qwen2.5-Coder-32B-Instruct-GPTQ-Int4

Text Generation • Updated Nov 18, 2024 • 23.9k • 12