--- base_model: - princeton-nlp/gemma-2-9b-it-SimPO - TheDrummer/Gemmasutra-9B-v1.1 library_name: transformers tags: - mergekit - merge --- # merge This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit). ## Merge Details ### Merge Method This model was merged using the [SLERP](https://en.wikipedia.org/wiki/Slerp) merge method. ### Models Merged The following models were included in the merge: * [princeton-nlp/gemma-2-9b-it-SimPO](https://huggingface.co/princeton-nlp/gemma-2-9b-it-SimPO) * [TheDrummer/Gemmasutra-9B-v1.1](https://huggingface.co/TheDrummer/Gemmasutra-9B-v1.1) ### Configuration The following YAML configuration was used to produce this model: ```yaml slices: - sources: - model: TheDrummer/Gemmasutra-9B-v1.1 layer_range: - 0 - 42 - model: princeton-nlp/gemma-2-9b-it-SimPO layer_range: - 0 - 42 merge_method: slerp base_model: TheDrummer/Gemmasutra-9B-v1.1 parameters: t: - filter: self_attn value: - 0.2 - 0.4 - 0.6 - 0.2 - 0.4 - filter: mlp value: - 0.8 - 0.6 - 0.4 - 0.8 - 0.6 - value: 0.4 dtype: bfloat16 ```