grimjim
/

kuno-kunoichi-v1-DPO-v2-SLERP-7B-GGUF

Text Generation

Inference Endpoints

Model card Files Files and versions Community

kuno-kunoichi-v1-DPO-v2-SLERP-7B-GGUF / README.md

grimjim's picture

Update README.md

04fef2f verified 6 months ago

|

history blame contribute delete

1.64 kB

	---
	base_model: grimjim/kuno-kunoichi-v1-DPO-v2-SLERP-7B
	library_name: transformers
	quanted_by: grimjim
	license: cc-by-nc-4.0
	pipeline_tag: text-generation
	---
	# kuno-kunoichi-v1-DPO-v2-SLERP-7B-GGUF

	This repo contains GGUF quants of [kuno-kunoichi-v1-DPO-v2-SLERP-7B](https://huggingface.co/grimjim/kuno-kunoichi-v1-DPO-v2-SLERP-7B).

	kuno-kunoichi-v1-DPO-v2-SLERP-7B is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
	The expectation is that the result is more robust against errors or when merging due to "denseness", as the two models likely implement comparable reasoning at least somewhat differently.

	Tested lightly with ChatML instruct format prompting using temperature=1.1 and minP=0.03. The model natively supports Alpaca format prompts.

	- [GGUF-IQ-Imatrix quants helpfully provided by Lewdiculous.](https://huggingface.co/Lewdiculous/kuno-kunoichi-v1-DPO-v2-SLERP-7B-GGUF-IQ-Imatrix)

	## Merge Details
	### Merge Method

	This model was merged using the SLERP merge method.

	### Models Merged

	The following models were included in the merge:
	* [SanjiWatsuki/Kunoichi-7B](https://huggingface.co/SanjiWatsuki/Kunoichi-7B)
	* [SanjiWatsuki/Kunoichi-DPO-v2-7B](https://huggingface.co/SanjiWatsuki/Kunoichi-DPO-v2-7B)

	### Configuration

	The following YAML configuration was used to produce this model:

	```yaml
	slices:
	- sources:
	- model: SanjiWatsuki/Kunoichi-7B
	layer_range: [0,32]
	- model: SanjiWatsuki/Kunoichi-DPO-v2-7B
	layer_range: [0,32]
	merge_method: slerp
	base_model: SanjiWatsuki/Kunoichi-7B
	parameters:
	t:
	- value: 0.5
	dtype: float16

	```