Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
2
1
6
Zhaolin Gao
GitBag
Follow
vinhnx90's profile picture
kirankc's profile picture
dark-pen's profile picture
3 followers
·
0 following
https://zhaolingao.github.io/
AI & ML interests
Reinforcement Learning from Human Feedback
Recent Activity
updated
a model
8 days ago
GitBag/reasoning_rebel_nianli_lr_3e-7_eta_1e6_1737077907
published
a model
8 days ago
GitBag/reasoning_rebel_nianli_lr_3e-7_eta_1e6_1737077907
updated
a dataset
8 days ago
GitBag/regenerated_responses_from_base_harvard
View all activity
Articles
RLHF 101: A Technical Dive into RLHF
Dec 11, 2024
•
5
Organizations
GitBag
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
New activity in
GitBag/multiturn_1_4
4 months ago
Dataset Viewer issue: ResponseNotFound
1
#1 opened 4 months ago by
GitBag
New activity in
Cornell-AGI/REBEL-Llama-3-epoch_2
8 months ago
model weights
1
#1 opened 8 months ago by
maldv