Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
amirabdullah19852020
/
gpt-neo-125m_hh_reward
like
0
Text Generation
Transformers
Safetensors
gpt_neo
trl
dpo
Generated from Trainer
Inference Endpoints
License:
mit
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
gpt-neo-125m_hh_reward
/
tokenizer.json
amirabdullah19852020
End of training
411a7fb
11 months ago
raw
Copy download link
history
contribute
delete
Safe
2.11 MB
File too large to display, you can
check the raw version
instead.