Zhaolin Gao's picture

2 1 6

Zhaolin Gao

GitBag

·

https://zhaolingao.github.io/

AI & ML interests

Reinforcement Learning from Human Feedback

Recent Activity

updated a model 8 days ago

GitBag/reasoning_rebel_nianli_lr_3e-7_eta_1e6_1737077907

published a model 8 days ago

GitBag/reasoning_rebel_nianli_lr_3e-7_eta_1e6_1737077907

updated a dataset 8 days ago

GitBag/regenerated_responses_from_base_harvard

View all activity

Articles

RLHF 101: A Technical Dive into RLHF

Organizations

GitBag's activity

New activity in GitBag/multiturn_1_4 4 months ago

Dataset Viewer issue: ResponseNotFound

#1 opened 4 months ago by

New activity in Cornell-AGI/REBEL-Llama-3-epoch_2 8 months ago

model weights

#1 opened 8 months ago by