Weijing Huang's picture

3 5 37

Weijing Huang

waleking

AI & ML interests

Language Models

Recent Activity

upvoted an article about 23 hours ago

Replicating DeepSeek R1 for Information Extraction

upvoted a paper 20 days ago

The Lessons of Developing Process Reward Models in Mathematical Reasoning

liked a Space 27 days ago

HuggingFaceH4/blogpost-scaling-test-time-compute

View all activity

Organizations

None yet

waleking's activity

upvoted an article about 23 hours ago

Article

Replicating DeepSeek R1 for Information Extraction

By

•

3 days ago

• 21

upvoted a paper 20 days ago

The Lessons of Developing Process Reward Models in Mathematical Reasoning

Paper • 2501.07301 • Published 22 days ago • 89

upvoted an article about 1 month ago

Article

Deriving DPO's Loss

By

•

Dec 24, 2024

• 26