Zhaolin Gao's picture

2 1 6

Zhaolin Gao

GitBag

·

https://zhaolingao.github.io/

AI & ML interests

Reinforcement Learning from Human Feedback

Recent Activity

updated a model 8 days ago

GitBag/reasoning_rebel_nianli_lr_3e-7_eta_1e6_1737077907

published a model 8 days ago

GitBag/reasoning_rebel_nianli_lr_3e-7_eta_1e6_1737077907

updated a dataset 8 days ago

GitBag/regenerated_responses_from_base_harvard

View all activity

Articles

RLHF 101: A Technical Dive into RLHF

Organizations

GitBag's activity

liked 3 models 4 months ago

Cornell-AGI/REBEL-Llama-3-Armo-iter_1

Updated Sep 2, 2024 • 4 • 1

Cornell-AGI/REBEL-Llama-3-Armo-iter_2

Updated Sep 2, 2024 • 2 • 2

Cornell-AGI/REBEL-Llama-3-Armo-iter_3

Updated Sep 2, 2024 • 6 • 2

liked a model 7 months ago

Cornell-AGI/REBEL-Llama-3-epoch_2

Text Generation • Updated Sep 1, 2024 • 32 • 3

liked 2 models 8 months ago

Cornell-AGI/REBEL-OpenChat-3.5

Text Generation • Updated Sep 1, 2024 • 17 • 1

Cornell-AGI/REBEL-Llama-3

Text Generation • Updated Sep 1, 2024 • 21 • 1