sthenno/tempesthenno-kto-0205-ckpt80

update: now checking for evaluations without chat templates

tempesthenno-icy-0130

This is a merge of pre-trained language models created using mergekit.

Merge Details

Merge Method

This model was merged using the SCE merge method using sthenno/tempesthenno-nuslerp-0124 as a base.

Models Merged

The following models were included in the merge:

  • sthenno/tempesthenno-icy-0130-01
  • sthenno/tempesthenno-icy-0130-02
  • sthenno/tempesthenno-icy-0130-03

Configuration

The following YAML configuration was used to produce this model:

name: tempesthenno-icy-0130
merge_method: sce
parameters:
  select_topk: 0.8
  normalize: true
dtype: float32
out_dtype: bfloat16
base_model: sthenno/tempesthenno-nuslerp-0124
tokenizer:
  source: base
chat_template: chatml
models:
  - model: sthenno/tempesthenno-icy-0130-01
  - model: sthenno/tempesthenno-icy-0130-02
  - model: sthenno/tempesthenno-icy-0130-03

Open LLM Leaderboard Evaluation Results

Detailed results can be found here

Metric Value
Avg. 39.74
IFEval (0-Shot) 62.18
BBH (3-Shot) 50.10
MATH Lvl 5 (4-Shot) 37.99
GPQA (0-shot) 19.69
MuSR (0-shot) 19.84
MMLU-PRO (5-shot) 48.65
Downloads last month
66
Safetensors
Model size
14.8B params
Tensor type
BF16
ยท
Inference Providers NEW
This model is not currently available via any of the supported Inference Providers.
The model cannot be deployed to the HF Inference API: The model has no library tag.

Model tree for sthenno/tempesthenno-kto-0205-ckpt80

Finetuned
(2)
this model
Merges
1 model

Space using sthenno/tempesthenno-kto-0205-ckpt80 1

Evaluation results