Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
amirabdullah19852020
/
gpt-neo-125m_hh_reward
like
0
Text Generation
Transformers
Safetensors
gpt_neo
trl
dpo
Generated from Trainer
Inference Endpoints
License:
mit
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
460c39e
gpt-neo-125m_hh_reward
1 contributor
History:
84 commits
amirabdullah19852020
Update README.md
460c39e
11 months ago
.gitattributes
1.52 kB
initial commit
11 months ago
README.md
3.75 kB
Update README.md
11 months ago
config.json
1.11 kB
End of training
11 months ago
generation_config.json
119 Bytes
End of training
11 months ago
merges.txt
456 kB
End of training
11 months ago
model.safetensors
501 MB
LFS
End of training
11 months ago
special_tokens_map.json
470 Bytes
End of training
11 months ago
tokenizer.json
2.11 MB
End of training
11 months ago
tokenizer_config.json
525 Bytes
End of training
11 months ago
training_args.bin
pickle
Detected Pickle imports (8)
"transformers.trainer_utils.SchedulerType"
,
"accelerate.state.PartialState"
,
"transformers.trainer_utils.IntervalStrategy"
,
"transformers.trainer_utils.HubStrategy"
,
"transformers.training_args.OptimizerNames"
,
"transformers.training_args.TrainingArguments"
,
"torch.device"
,
"accelerate.utils.dataclasses.DistributedType"
How to fix it?
4.28 kB
LFS
Training in progress, step 500
11 months ago
vocab.json
798 kB
End of training
11 months ago