Jingcheng Hu
reign12
AI & ML interests
Foundation models and alignment
Recent Activity
liked
a model
15 days ago
deepseek-ai/DeepSeek-R1-Zero
liked
a model
15 days ago
deepseek-ai/DeepSeek-R1
liked
a Space
2 months ago
Qwen/QwQ-32B-preview
Organizations
reign12's activity
Add paper link
#3 opened 8 months ago
by
AdinaY
33B when?
2
#8 opened over 1 year ago
by
nova434431
Question about evaluating this reward model on Anthropic/hh-rlhf
1
#4 opened almost 2 years ago
by
songff
More details on training data for reward model
#2 opened over 1 year ago
by
reign12
How is this dataset filtered?
#1 opened over 1 year ago
by
reign12
大神是怎么收集这么多高质量的数据的啊
3
#1 opened almost 2 years ago
by
leonall