mehmetkeremturkcan
/

SmollerLM-20M-Instruct-Pruned-sft5-dpo3

Text Generation

Generated from Trainer

text-generation-inference

Inference Endpoints

Model card Files Files and versions Metrics Training metrics Community

SmollerLM-20M-Instruct-Pruned-sft5-dpo3

1 contributor

History: 24 commits

mehmetkeremturkcan's picture

mehmetkeremturkcan

Training in progress, step 11649

00148cc verified about 8 hours ago

runs
Training in progress, step 11649 about 8 hours ago
.gitattributes

1.52 kB

initial commit about 13 hours ago
config.json

954 Bytes

Training in progress, step 1000 about 13 hours ago
merges.txt

466 kB

Training in progress, step 1000 about 13 hours ago
model.safetensors

57.1 MB
LFS

Training in progress, step 11649 about 8 hours ago
special_tokens_map.json

655 Bytes

Training in progress, step 1000 about 13 hours ago
tokenizer.json

3.52 MB

Training in progress, step 1000 about 13 hours ago
tokenizer_config.json

3.84 kB

Training in progress, step 1000 about 13 hours ago
training_args.bin
Detected Pickle imports (11)
- "transformers.trainer_utils.IntervalStrategy",
- "transformers.training_args.OptimizerNames",
- "trl.trainer.dpo_config.DPOConfig",
- "trl.trainer.dpo_config.FDivergenceType",
- "transformers.trainer_pt_utils.AcceleratorConfig",
- "accelerate.state.PartialState",
- "torch.device",
- "transformers.trainer_utils.HubStrategy",
- "transformers.trainer_utils.SchedulerType",
- "transformers.trainer_utils.SaveStrategy",
- "accelerate.utils.dataclasses.DistributedType"
How to fix it?
6.33 kB
LFS

Training in progress, step 1000 about 13 hours ago
vocab.json

801 kB

Training in progress, step 1000 about 13 hours ago