60 24 86

Chujie Zheng

chujiezheng

https://chujiezheng.github.io/

AI & ML interests

Large Language Models

Recent Activity

authored a paper 5 days ago

Aligning Instruction Tuning with Pre-training

authored a paper 5 days ago

SuperGPQA: Scaling LLM Evaluation across 285 Graduate Disciplines

commented on a paper 5 days ago

SuperGPQA: Scaling LLM Evaluation across 285 Graduate Disciplines

View all activity

Organizations

chujiezheng's activity

authored 2 papers 5 days ago

Aligning Instruction Tuning with Pre-training

Paper • 2501.09368 • Published Jan 16

SuperGPQA: Scaling LLM Evaluation across 285 Graduate Disciplines

Paper • 2502.14739 • Published 5 days ago • 91

commented a paper 5 days ago

SuperGPQA: Scaling LLM Evaluation across 285 Graduate Disciplines

Paper • 2502.14739 • Published 5 days ago • 91 •

upvoted a paper 5 days ago

SuperGPQA: Scaling LLM Evaluation across 285 Graduate Disciplines

Paper • 2502.14739 • Published 5 days ago • 91

liked 5 models 21 days ago

liked 3 Spaces 21 days ago

Qwen2.5 VL 72B Instruct

💻

Interact with Qwen2.5-VL-72B to get responses and generate images

Qwen2.5-1M Demo

💻

Upload documents to answer questions

539

Qwen2.5 Max Demo

🐢

Chat with an AI language model

upvoted a paper about 1 month ago

Demons in the Detail: On Implementing Load Balancing Loss for Training Specialized Mixture-of-Expert Models

Paper • 2501.11873 • Published Jan 21 • 63

authored a paper about 1 month ago

The Lessons of Developing Process Reward Models in Mathematical Reasoning

Paper • 2501.07301 • Published Jan 13 • 91

commented a paper about 1 month ago

The Lessons of Developing Process Reward Models in Mathematical Reasoning

Paper • 2501.07301 • Published Jan 13 • 91 •

upvoted a paper about 1 month ago

The Lessons of Developing Process Reward Models in Mathematical Reasoning

Paper • 2501.07301 • Published Jan 13 • 91

commented a paper about 1 month ago

The Lessons of Developing Process Reward Models in Mathematical Reasoning

Paper • 2501.07301 • Published Jan 13 • 91 •

upvoted a paper about 1 month ago

Enabling Scalable Oversight via Self-Evolving Critic

Paper • 2501.05727 • Published Jan 10 • 70

liked 2 models about 1 month ago

Qwen/Qwen2.5-Math-7B-PRM800K

Text Classification • Updated Jan 17 • 2.58k • 13

Qwen/Qwen2.5-Math-PRM-72B

Text Classification • Updated Jan 17 • 987 • 68