Jingcheng Hu's picture

Jingcheng Hu

reign12

·

AI & ML interests

Foundation models and alignment

Recent Activity

liked a model 15 days ago

deepseek-ai/DeepSeek-R1-Zero

liked a model 15 days ago

deepseek-ai/DeepSeek-R1

liked a Space 2 months ago

Qwen/QwQ-32B-preview

View all activity

Organizations

reign12's activity

New activity in Xwin-LM/Xwin-Math-70B-V1.0 8 months ago

Add paper link

#3 opened 8 months ago by

New activity in Xwin-LM/Xwin-LM-70B-V0.1 over 1 year ago

33B when?

#8 opened over 1 year ago by

New activity in OpenAssistant/reward-model-deberta-v3-large-v2 over 1 year ago

Question about evaluating this reward model on Anthropic/hh-rlhf

#4 opened almost 2 years ago by

New activity in OpenAssistant/oasst-rm-2-pythia-6.9b-epoch-1 over 1 year ago

More details on training data for reward model

#2 opened over 1 year ago by

New activity in Dahoas/filtered-SHP over 1 year ago

How is this dataset filtered?

#1 opened over 1 year ago by

New activity in YeungNLP/firefly-train-1.1M almost 2 years ago

大神是怎么收集这么多高质量的数据的啊

#1 opened almost 2 years ago by