merge

This is a merge of pre-trained language models created using mergekit.

Merge Details

We're all stories in the end

Merge Method

This model was merged using the SLERP merge method.

Models Merged

The following models were included in the merge:

Configuration

The following YAML configuration was used to produce this model:

models:
  - model: Krystalan/DRT-o1-14B
  - model: v000000/Qwen2.5-Lumen-14B
merge_method: slerp
base_model: v000000/Qwen2.5-Lumen-14B
dtype: bfloat16
parameters:
  t: [0, 0.5, 1, 0.5, 0]

Open LLM Leaderboard Evaluation Results

Detailed results can be found here

Metric	Value
Avg.	37.56
IFEval (0-Shot)	46.67
BBH (3-Shot)	48.91
MATH Lvl 5 (4-Shot)	45.77
GPQA (0-shot)	16.44
MuSR (0-shot)	19.88
MMLU-PRO (5-shot)	47.67

Downloads last month: 34

Safetensors

Model size

14.8B params

Tensor type

BF16

Inference Examples

Text Generation

This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.

Model tree for Triangle104/Herodotos-14B

Krystalan/DRT-o1-14B

v000000/Qwen2.5-Lumen-14B

Merge model

this model

Quantizations

4 models

Collections including Triangle104/Herodotos-14B

Evaluation results

strict accuracy on IFEval (0-Shot)
Open LLM Leaderboard

46.670
normalized accuracy on BBH (3-Shot)
Open LLM Leaderboard

48.910
exact match on MATH Lvl 5 (4-Shot)
Open LLM Leaderboard

45.770
acc_norm on GPQA (0-shot)
Open LLM Leaderboard

16.440
acc_norm on MuSR (0-shot)
Open LLM Leaderboard

19.880
accuracy on MMLU-PRO (5-shot)
test set Open LLM Leaderboard

47.670

View on Papers With Code