Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
amirabdullah19852020
/
gpt-neo-125m_hh_reward
like
0
Text Generation
Transformers
Safetensors
gpt_neo
trl
dpo
Generated from Trainer
Inference Endpoints
License:
mit
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
03b48cd
gpt-neo-125m_hh_reward
/
model.safetensors
Commit History
Training in progress, step 6000
159b197
amirabdullah19852020
commited on
Dec 31, 2023
Training in progress, step 5500
3fa9831
amirabdullah19852020
commited on
Dec 31, 2023
Training in progress, step 5000
09c2244
amirabdullah19852020
commited on
Dec 31, 2023
Training in progress, step 4500
48a1538
amirabdullah19852020
commited on
Dec 31, 2023
Training in progress, step 4000
5af2551
amirabdullah19852020
commited on
Dec 31, 2023
Training in progress, step 3500
b51eff9
amirabdullah19852020
commited on
Dec 31, 2023
Training in progress, step 3000
572b75b
amirabdullah19852020
commited on
Dec 31, 2023
Training in progress, step 2500
2e08d99
amirabdullah19852020
commited on
Dec 31, 2023
Training in progress, step 2000
978daa3
amirabdullah19852020
commited on
Dec 31, 2023
Training in progress, step 1500
71ef745
amirabdullah19852020
commited on
Dec 31, 2023
Training in progress, step 1000
33dc731
amirabdullah19852020
commited on
Dec 31, 2023
Training in progress, step 500
4a11258
amirabdullah19852020
commited on
Dec 31, 2023
End of training
411a7fb
amirabdullah19852020
commited on
Dec 31, 2023
Previous
1
2
3
Next