README.md · redrix/matricide-12B-Unslop-Unleashed-v2 at a3b1c8400d00ae19f22696fef5530e2f982a9e45

matricide-12B-Unslop-Unleashed-v2 / README.md

redrix

Update README.md

a3b1c84 verified 2 months ago

preview code

raw

history blame

2.36 kB

	---
	base_model:
	- MarinaraSpaghetti/NemoMix-Unleashed-12B
	- TheDrummer/UnslopNemo-12B-v4
	library_name: transformers
	tags:
	- mergekit
	- merge
	- 12b
	- chat
	- roleplay
	- creative-writing
	- NuSLERP
	license: apache-2.0
	---
	# matricide-12B-Unslop-Unleashed-v2
	> Her ‘Love’ only existed to rein in my ambition. The stagnancy became unbearable.

	This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).

	This is my sixth model. Also the first working model using the NuSLERP merge method. The original was intended to introduce [UnslopNemo](https://huggingface.co/TheDrummer/UnslopNemo-12B-v4) to combat GPTisms of [NemoMix](https://huggingface.co/MarinaraSpaghetti/NemoMix-Unleashed-12B). I used [UnslopNemo-4](https://huggingface.co/TheDrummer/UnslopNemo-12B-v4) as it supposedly has bigger anti-GPTism effects at the cost of intelligence.

	## Testing stage: early testing
	I do not know how this model holds up over long term context. Early testing showed stability and viable answers.

	## Parameters
	- Context size: Not more than 20k recommended - coherency may degrade.
	- Chat Template: ChatML; Metharme/Pygmalion (as per UnslopNemo) may work, but effects are untested
	- Samplers: A Temperature-Last of 1 and Min-P of 0.1 are viable, but haven't been finetuned. Activate DRY if repetition appears. XTC is untested.

	## Quantization
	Static GGUF Quants available at [redrix/matricide-12B-Unslop-Unleashed-v2-GGUF](https://huggingface.co/redrix/matricide-12B-Unslop-Unleashed-v2-GGUF)

	## Merge Details
	### Merge Method

	This model was merged using the NuSLERP merge method.

	### Models Merged

	The following models were included in the merge:
	* [MarinaraSpaghetti/NemoMix-Unleashed-12B](https://huggingface.co/MarinaraSpaghetti/NemoMix-Unleashed-12B)
	* [TheDrummer/UnslopNemo-12B-v4](https://huggingface.co/TheDrummer/UnslopNemo-12B-v4)

	### Configuration

	The following YAML configuration was used to produce this model:

	```yaml
	models:
	- model: TheDrummer/UnslopNemo-12B-v4
	parameters:
	weight: [0.8, 0.4, 0.3, 0.5, 0.6]
	- model: MarinaraSpaghetti/NemoMix-Unleashed-12B
	parameters:
	weight: [0.2, 0.6, 0.7, 0.5, 0.4]
	merge_method: nuslerp
	dtype: bfloat16
	chat_template: "chatml"
	tokenizer:
	source: union
	parameters:
	normalize: true
	int8_mask: true


	```