Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
jtatman
/
felladrin-tinymistral-248m-v4-dpo
like
0
Text Generation
Transformers
Safetensors
argilla/distilabel-intel-orca-dpo-pairs
mistral
DPO
reasoning
conversational
text-generation-inference
Inference Endpoints
License:
apache-2.0
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
Edit model card
Model Card for felladrin-tinymistral-248m-v4-dpo
Model Details
Model Description
Model Card for felladrin-tinymistral-248m-v4-dpo
SFT model trained with orca DPO
Model Details
Model Description
Experimental.
ChatML format.
Downloads last month
8
Safetensors
Model size
248M params
Tensor type
F32
·
Inference Examples
Text Generation
Inference API (serverless) is not available, repository is disabled.
Dataset used to train
jtatman/felladrin-tinymistral-248m-v4-dpo
argilla/distilabel-intel-orca-dpo-pairs
Viewer
•
Updated
Feb 5
•
12.9k
•
2.14k
•
161