Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
amirabdullah19852020
/
gpt-neo-125m_hh_reward
like
0
Text Generation
Transformers
Safetensors
gpt_neo
trl
dpo
Generated from Trainer
Inference Endpoints
License:
mit
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
556cfc3
gpt-neo-125m_hh_reward
/
training_args.bin
Commit History
Training in progress, step 500
8b8b241
amirabdullah19852020
commited on
Dec 31, 2023
Training in progress, step 500
4a11258
amirabdullah19852020
commited on
Dec 31, 2023
End of training
411a7fb
amirabdullah19852020
commited on
Dec 31, 2023