Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
distily
/
distily_norm_distilgpt2_sweep_extended
like
0
TensorBoard
Safetensors
wikimedia/wikipedia
Distily
gpt2
Generated from Trainer
License:
apache-2.0
Model card
Files
Files and versions
Metrics
Training metrics
Community
1d4c2a8
distily_norm_distilgpt2_sweep_extended
1 contributor
History:
40 commits
lapp0
Training in progress, step 123750
1d4c2a8
verified
27 days ago
logs
Training in progress, step 123750
27 days ago
.gitattributes
1.52 kB
initial commit
29 days ago
README.md
3.7 kB
Training in progress, step 123750
27 days ago
benchmarks.shelve.bak
pickle
0 Bytes
End of training
29 days ago
benchmarks.shelve.dat
pickle
0 Bytes
End of training
29 days ago
benchmarks.shelve.dir
pickle
0 Bytes
End of training
29 days ago
config.json
1.02 kB
Training in progress, step 123750
27 days ago
generation_config.json
119 Bytes
Training in progress, step 123750
27 days ago
merges.txt
456 kB
End of training
29 days ago
model.safetensors
164 MB
LFS
Training in progress, step 123750
27 days ago
special_tokens_map.json
131 Bytes
End of training
29 days ago
tokenizer.json
2.11 MB
End of training
29 days ago
tokenizer_config.json
476 Bytes
End of training
29 days ago
training_args.bin
pickle
Detected Pickle imports (9)
"torch.device"
,
"transformers.trainer_utils.HubStrategy"
,
"transformers.trainer_utils.SchedulerType"
,
"distily.args.DistillationTrainingArguments"
,
"transformers.trainer_utils.IntervalStrategy"
,
"accelerate.state.PartialState"
,
"accelerate.utils.dataclasses.DistributedType"
,
"transformers.trainer_pt_utils.AcceleratorConfig"
,
"transformers.training_args.OptimizerNames"
How to fix it?
5.62 kB
LFS
Training in progress, step 123750
27 days ago
vocab.json
798 kB
End of training
29 days ago