Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
2
1
6
Zhaolin Gao
GitBag
Follow
kirankc's profile picture
vinhnx90's profile picture
dark-pen's profile picture
3 followers
·
0 following
https://zhaolingao.github.io/
AI & ML interests
Reinforcement Learning from Human Feedback
Recent Activity
updated
a model
8 days ago
GitBag/reasoning_rebel_nianli_lr_3e-7_eta_1e6_1737077907
published
a model
8 days ago
GitBag/reasoning_rebel_nianli_lr_3e-7_eta_1e6_1737077907
updated
a dataset
8 days ago
GitBag/regenerated_responses_from_base_harvard
View all activity
Articles
RLHF 101: A Technical Dive into RLHF
Dec 11, 2024
•
5
Organizations
GitBag
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
liked
3 models
4 months ago
Cornell-AGI/REBEL-Llama-3-Armo-iter_1
Updated
Sep 2, 2024
•
4
•
1
Cornell-AGI/REBEL-Llama-3-Armo-iter_2
Updated
Sep 2, 2024
•
2
•
2
Cornell-AGI/REBEL-Llama-3-Armo-iter_3
Updated
Sep 2, 2024
•
6
•
2
liked
a model
7 months ago
Cornell-AGI/REBEL-Llama-3-epoch_2
Text Generation
•
Updated
Sep 1, 2024
•
32
•
3
liked
2 models
8 months ago
Cornell-AGI/REBEL-OpenChat-3.5
Text Generation
•
Updated
Sep 1, 2024
•
17
•
1
Cornell-AGI/REBEL-Llama-3
Text Generation
•
Updated
Sep 1, 2024
•
21
•
1