Model Card for Model ID

Model trained based on deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B by Self-Calibration proposed by Efficient Test-Time Scaling via Self-Calibration.

Model Sources

Citation

BibTeX:

@misc{huang2025efficienttesttimescalingselfcalibration,
      title={Efficient Test-Time Scaling via Self-Calibration}, 
      author={Chengsong Huang and Langlin Huang and Jixuan Leng and Jiacheng Liu and Jiaxin Huang},
      year={2025},
      eprint={2503.00031},
      archivePrefix={arXiv},
      primaryClass={cs.LG},
      url={https://arxiv.org/abs/2503.00031}, 
}

Model Card Contact

[email protected]

Downloads last month
243
Safetensors
Model size
1.78B params
Tensor type
F32
·
Inference Providers NEW
This model is not currently available via any of the supported Inference Providers.

Model tree for HINT-lab/DeepSeek-R1-Distill-Qwen-1.5B-Self-Calibration

Finetuned
(144)
this model

Dataset used to train HINT-lab/DeepSeek-R1-Distill-Qwen-1.5B-Self-Calibration

Collection including HINT-lab/DeepSeek-R1-Distill-Qwen-1.5B-Self-Calibration