Model Card for Model ID

microsoft/Phi-3-medium-4k-instruct trained with ORPO trainer.

Training Details

Training Data

mlabonne/orpo-dpo-mix-40k is used for finetuning this model.

[More Information Needed]

Training Procedure

Trained with ORPO trainer, and only first 5K rows are used for finetuning (5K out of 40K).

Open LLM Leaderboard Evaluation Results

Detailed results can be found here

Metric	Value
Avg.	26.84
IFEval (0-Shot)	40.22
BBH (3-Shot)	46.63
MATH Lvl 5 (4-Shot)	16.69
GPQA (0-shot)	7.38
MuSR (0-shot)	10.53
MMLU-PRO (5-shot)	39.60

Downloads last month: 7

Safetensors

Model size

14B params

Tensor type

BF16

Inference Providers NEW

Text2Text Generation

This model is not currently available via any of the supported third-party Inference Providers, and the model is not deployed on the HF Inference API.

Model tree for BlackBeenie/Neos-Phi-3-14B-v0.1

Base model

microsoft/Phi-3-medium-4k-instruct

Finetuned

(5)

this model

Quantizations

2 models

Dataset used to train BlackBeenie/Neos-Phi-3-14B-v0.1

Evaluation results

strict accuracy on IFEval (0-Shot)
Open LLM Leaderboard

40.220
normalized accuracy on BBH (3-Shot)
Open LLM Leaderboard

46.630
exact match on MATH Lvl 5 (4-Shot)
Open LLM Leaderboard

16.690
acc_norm on GPQA (0-shot)
Open LLM Leaderboard

7.380
acc_norm on MuSR (0-shot)
Open LLM Leaderboard

10.530
accuracy on MMLU-PRO (5-shot)
test set Open LLM Leaderboard

39.600

View on Papers With Code