Model Card for Model ID

microsoft/Phi-3-medium-4k-instruct trained with ORPO trainer.

Training Details

Training Data

mlabonne/orpo-dpo-mix-40k is used for finetuning this model.

[More Information Needed]

Training Procedure

Trained with ORPO trainer, and only first 5K rows are used for finetuning (5K out of 40K).

Open LLM Leaderboard Evaluation Results

Detailed results can be found here

Metric Value
Avg. 26.84
IFEval (0-Shot) 40.22
BBH (3-Shot) 46.63
MATH Lvl 5 (4-Shot) 16.69
GPQA (0-shot) 7.38
MuSR (0-shot) 10.53
MMLU-PRO (5-shot) 39.60
Downloads last month
7
Safetensors
Model size
14B params
Tensor type
BF16
·
Inference Providers NEW
This model is not currently available via any of the supported third-party Inference Providers, and the model is not deployed on the HF Inference API.

Model tree for BlackBeenie/Neos-Phi-3-14B-v0.1

Finetuned
(5)
this model
Quantizations
2 models

Dataset used to train BlackBeenie/Neos-Phi-3-14B-v0.1

Evaluation results