--- base_model: - mistralai/Mistral-Small-24B-Instruct-2501 - mistralai/Mistral-Small-24B-Base-2501 library_name: transformers tags: - mergekit - merge --- # merge This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit). ## Merge Details ### Merge Method This model was merged using the [TIES](https://arxiv.org/abs/2306.01708) merge method using [mistralai/Mistral-Small-24B-Base-2501](https://huggingface.co/mistralai/Mistral-Small-24B-Base-2501) as a base. ### Models Merged The following models were included in the merge: * [mistralai/Mistral-Small-24B-Instruct-2501](https://huggingface.co/mistralai/Mistral-Small-24B-Instruct-2501) ### Configuration The following YAML configuration was used to produce this model: ```yaml models: - model: "mistralai/Mistral-Small-24B-Base-2501" parameters: weight: 0.7 density: 1 - model: "mistralai/Mistral-Small-24B-Instruct-2501" parameters: weight: 0.3 density: 1 merge_method: ties base_model: "mistralai/Mistral-Small-24B-Base-2501" parameters: density: 1 normalize: true int8_mask: true dtype: bfloat16 output: "Mistral-Small-24B-TIES-Merged" ```