--- base_model: - arcee-ai/Virtuoso-Small - deepseek-ai/DeepSeek-R1-Distill-Qwen-14B - Qwen/Qwen2.5-Coder-14B - sthenno/tempesthenno-ppo-ckpt40 - sthenno-com/miscii-14b-1225 library_name: transformers tags: - mergekit - merge --- # merge This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit). ## Merge Details ### Merge Method This model was merged using the [Model Stock](https://arxiv.org/abs/2403.19522) merge method using [sthenno-com/miscii-14b-1225](https://huggingface.co/sthenno-com/miscii-14b-1225) as a base. ### Models Merged The following models were included in the merge: * [arcee-ai/Virtuoso-Small](https://huggingface.co/arcee-ai/Virtuoso-Small) * [deepseek-ai/DeepSeek-R1-Distill-Qwen-14B](https://huggingface.co/deepseek-ai/DeepSeek-R1-Distill-Qwen-14B) * [Qwen/Qwen2.5-Coder-14B](https://huggingface.co/Qwen/Qwen2.5-Coder-14B) * [sthenno/tempesthenno-ppo-ckpt40](https://huggingface.co/sthenno/tempesthenno-ppo-ckpt40) ### Configuration The following YAML configuration was used to produce this model: ```yaml merge_method: model_stock models: - model: sthenno/tempesthenno-ppo-ckpt40 parameters: weight: 1.0 - model: deepseek-ai/DeepSeek-R1-Distill-Qwen-14B parameters: weight: 1.0 - model: sthenno/tempesthenno-ppo-ckpt40 parameters: weight: 1.0 - model: arcee-ai/Virtuoso-Small parameters: weight: 1.0 - model: Qwen/Qwen2.5-Coder-14B parameters: weight: 1.0 base_model: sthenno-com/miscii-14b-1225 tokenizer_source: base dtype: bfloat16 normalize: true ```