Not-For-All-Audiences

nsfw

roleplay

role-play

conversational

Eval Results

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

L3-Umbral-Mind-RP-v1.0-8B / README.md

Casual-Autopsy

Update README.md

73efd3c verified 4 months ago

preview code

raw

history blame contribute delete

9.11 kB

	---
	tags:
	- merge
	- mergekit
	- lazymergekit
	- not-for-all-audiences
	- nsfw
	- rp
	- roleplay
	- role-play
	license: llama3
	language:
	- en
	library_name: transformers
	pipeline_tag: text-generation
	base_model:
	- Sao10K/L3-8B-Stheno-v3.2
	- Hastagaras/Jamet-8B-L3-MK.V-Blackroot
	- grimjim/Llama-3-Oasis-v1-OAS-8B
	- Casual-Autopsy/SOVL-MopeyMule-8B
	- Casual-Autopsy/MopeyMule-Blackroot-8B
	- ResplendentAI/Theory_of_Mind_Llama3
	- ResplendentAI/RP_Format_QuoteAsterisk_Llama3
	- ResplendentAI/Smarts_Llama3
	- Casual-Autopsy/L3-Umbral-Mind-RP-v0.3-8B
	- Casual-Autopsy/Llama-3-Mopeyfied-Psychology-8B
	- Hastagaras/Halu-8B-Llama3-Blackroot
	model-index:
	- name: L3-Umbral-Mind-RP-v1.0-8B
	results:
	- task:
	type: text-generation
	name: Text Generation
	dataset:
	name: Unruly
	type: Unruly
	metrics:
	- name: accuracy
	type: acc
	value: 54.6
	source:
	name: UGI Leaderboard
	url: https://huggingface.co/spaces/DontPlanToEnd/UGI-Leaderboard
	- task:
	type: text-generation
	name: Text Generation
	dataset:
	name: Internet
	type: Internet
	metrics:
	- name: accuracy
	type: acc
	value: 31.4
	source:
	name: UGI Leaderboard
	url: https://huggingface.co/spaces/DontPlanToEnd/UGI-Leaderboard
	- task:
	type: text-generation
	name: Text Generation
	dataset:
	name: CrimeStats
	type: CrimeStats
	metrics:
	- name: accuracy
	type: acc
	value: 45
	source:
	name: UGI Leaderboard
	url: https://huggingface.co/spaces/DontPlanToEnd/UGI-Leaderboard
	- task:
	type: text-generation
	name: Text Generation
	dataset:
	name: Stories/Jokes
	type: Stories/Jokes
	metrics:
	- name: accuracy
	type: acc
	value: 56.8
	source:
	name: UGI Leaderboard
	url: https://huggingface.co/spaces/DontPlanToEnd/UGI-Leaderboard
	- task:
	type: text-generation
	name: Text Generation
	dataset:
	name: PolContro
	type: PolContro
	metrics:
	- name: accuracy
	type: acc
	value: 66.7
	source:
	name: UGI Leaderboard
	url: https://huggingface.co/spaces/DontPlanToEnd/UGI-Leaderboard
	- task:
	type: text-generation
	name: Text Generation
	dataset:
	name: W/10
	type: W/10
	metrics:
	- name: willingness to answer
	type: willingness to answer
	value: 8
	source:
	name: UGI Leaderboard
	url: https://huggingface.co/spaces/DontPlanToEnd/UGI-Leaderboard
	---

	<img src="https://huggingface.co/Casual-Autopsy/L3-Umbral-Mind-RP-v3-8B/resolve/main/63073798_p0_master1200.jpg" style="display: block; margin: auto;">
	Image by ろ47

	Highest ranked 8B model on the [UGI Leaderboard](https://huggingface.co/spaces/DontPlanToEnd/UGI-Leaderboard) as of writing this!

	# Merge

	This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).

	## Merge Details

	The goal of this merge was to make an RP model better suited for role-plays with heavy themes such as but not limited to:
	- Mental illness
	- Self-harm
	- Trauma
	- Suicide

	I hated how RP models tended to be overly positive and hopeful with role-plays involving such themes,
	but thanks to [failspy/Llama-3-8B-Instruct-MopeyMule](https://huggingface.co/failspy/Llama-3-8B-Instruct-MopeyMule) this problem has been lessened considerably.

	If you're an enjoyer of savior/reverse savior type role-plays like myself, then this model is for you.

	### Usage Info

	This model is meant to be used with asterisks/quotes RPing formats, any other format that isn't asterisks/quotes is likely to cause issues

	### Quants

	- [Static GGUF quants](https://huggingface.co/mradermacher/L3-Umbral-Mind-RP-v1.0-8B-GGUF) by mradermacher
	- [Imatrix GGUF quants](https://huggingface.co/mradermacher/L3-Umbral-Mind-RP-v1.0-8B-i1-GGUF) by mradermacher
	- [Imatrix GGUF quants](https://huggingface.co/Lewdiculous/L3-Umbral-Mind-RP-v1.0-8B-GGUF-IQ-Imatrix) by Lewdiculous
	- exl2's by riveRiPH:
	- [L3-Umbral-Mind-RP-v1.0-8B-8bpw-h8-exl2](https://huggingface.co/riveRiPH/L3-Umbral-Mind-RP-v1.0-8B-8bpw-h8-exl2)
	- [L3-Umbral-Mind-RP-v1.0-8B-6.3bpw-h8-exl2](https://huggingface.co/riveRiPH/L3-Umbral-Mind-RP-v1.0-8B-6.3bpw-h8-exl2)
	- [L3-Umbral-Mind-RP-v1.0-8B-5.3bpw-h6-exl2](https://huggingface.co/riveRiPH/L3-Umbral-Mind-RP-v1.0-8B-5.3bpw-h6-exl2)

	### Merge Method

	This model was merged using several Task Arithmetic merges and then tied together with a Model Stock merge, followed by another Task Arithmetic merge with a model containing psychology data.

	### Models Merged

	The following models were included in the merge:
	* [Sao10K/L3-8B-Stheno-v3.2](https://huggingface.co/Sao10K/L3-8B-Stheno-v3.2)
	* [Hastagaras/Halu-8B-Llama3-Blackroot](Hastagaras/Halu-8B-Llama3-Blackroot)
	* [Casual-Autopsy/Llama-3-Mopeyfied-Psychology-8B](https://huggingface.co/Casual-Autopsy/Llama-3-Mopeyfied-Psychology-8B)
	* [Casual-Autopsy/L3-Umbral-Mind-RP-v0.3-8B](https://huggingface.co/Casual-Autopsy/L3-Umbral-Mind-RP-v0.3-8B)
	* Casual-Autopsy/Umbral-v3-1 + [ResplendentAI/Theory_of_Mind_Llama3](https://huggingface.co/ResplendentAI/Theory_of_Mind_Llama3)
	* [Sao10K/L3-8B-Stheno-v3.2](https://huggingface.co/Sao10K/L3-8B-Stheno-v3.2)
	* [Casual-Autopsy/SOVL-MopeyMule-8B](https://huggingface.co/Casual-Autopsy/SOVL-MopeyMule-8B)
	* [Casual-Autopsy/MopeyMule-Blackroot-8B](https://huggingface.co/Casual-Autopsy/MopeyMule-Blackroot-8B)

	* Casual-Autopsy/Umbral-v3-2 + [ResplendentAI/Smarts_Llama3](https://huggingface.co/ResplendentAI/Smarts_Llama3)
	* [Hastagaras/Jamet-8B-L3-MK.V-Blackroot](https://huggingface.co/Hastagaras/Jamet-8B-L3-MK.V-Blackroot)
	* [Casual-Autopsy/SOVL-MopeyMule-8B](https://huggingface.co/Casual-Autopsy/SOVL-MopeyMule-8B)
	* [Casual-Autopsy/MopeyMule-Blackroot-8B](https://huggingface.co/Casual-Autopsy/MopeyMule-Blackroot-8B)

	* Casual-Autopsy/Umbral-v3-3 + [ResplendentAI/RP_Format_QuoteAsterisk_Llama3](https://huggingface.co/ResplendentAI/RP_Format_QuoteAsterisk_Llama3)
	* [grimjim/Llama-3-Oasis-v1-OAS-8B](https://huggingface.co/grimjim/Llama-3-Oasis-v1-OAS-8B)
	* [Casual-Autopsy/SOVL-MopeyMule-8B](https://huggingface.co/Casual-Autopsy/SOVL-MopeyMule-8B)
	* [Casual-Autopsy/MopeyMule-Blackroot-8B](https://huggingface.co/Casual-Autopsy/MopeyMule-Blackroot-8B)

	## Secret Sauce

	The following YAML configurations were used to produce this model:

	### Umbral-v3-1

	```yaml
	slices:
	- sources:
	- model: Sao10K/L3-8B-Stheno-v3.2
	layer_range: [0, 32]
	parameters:
	weight: 0.65
	- model: Casual-Autopsy/SOVL-MopeyMule-8B
	layer_range: [0, 32]
	parameters:
	weight: 0.25
	- model: Casual-Autopsy/MopeyMule-Blackroot-8B
	layer_range: [0, 32]
	parameters:
	weight: 0.1
	merge_method: task_arithmetic
	base_model: Sao10K/L3-8B-Stheno-v3.2
	normalize: False
	dtype: bfloat16
	```

	### Umbral-v3-2

	```yaml
	slices:
	- sources:
	- model: Hastagaras/Jamet-8B-L3-MK.V-Blackroot
	layer_range: [0, 32]
	parameters:
	weight: 0.75
	- model: Casual-Autopsy/SOVL-MopeyMule-8B
	layer_range: [0, 32]
	parameters:
	weight: 0.15
	- model: Casual-Autopsy/MopeyMule-Blackroot-8B
	layer_range: [0, 32]
	parameters:
	weight: 0.1
	merge_method: task_arithmetic
	base_model: Hastagaras/Jamet-8B-L3-MK.V-Blackroot
	normalize: False
	dtype: bfloat16
	```

	### Umbral-v3-3

	```yaml
	slices:
	- sources:
	- model: grimjim/Llama-3-Oasis-v1-OAS-8B
	layer_range: [0, 32]
	parameters:
	weight: 0.55
	- model: Casual-Autopsy/SOVL-MopeyMule-8B
	layer_range: [0, 32]
	parameters:
	weight: 0.35
	- model: Casual-Autopsy/MopeyMule-Blackroot-8B
	layer_range: [0, 32]
	parameters:
	weight: 0.1
	merge_method: task_arithmetic
	base_model: grimjim/Llama-3-Oasis-v1-OAS-8B
	normalize: False
	dtype: bfloat16
	```

	### L3-Umbral-Mind-RP-v0.3-8B

	```yaml
	models:
	- model: Casual-Autopsy/Umbral-v3-1+ResplendentAI/Theory_of_Mind_Llama3
	- model: Casual-Autopsy/Umbral-v3-2+ResplendentAI/Smarts_Llama3
	- model: Casual-Autopsy/Umbral-v3-3+ResplendentAI/RP_Format_QuoteAsterisk_Llama3
	merge_method: model_stock
	base_model: Casual-Autopsy/Umbral-v3-1
	dtype: bfloat16
	```

	### L3-Umbral-Mind-RP-v1.0-8B

	```yaml
	slices:
	- sources:
	- model: Casual-Autopsy/L3-Umbral-Mind-RP-v0.3-8B
	layer_range: [0, 32]
	- model: Casual-Autopsy/Llama-3-Mopeyfied-Psychology-8B
	layer_range: [0, 32]
	parameters:
	weight: 0.14
	- model: Sao10K/L3-8B-Stheno-v3.2
	layer_range: [0, 32]
	parameters:
	weight: 0.03
	- model: Hastagaras/Halu-8B-Llama3-Blackroot
	layer_range: [0, 32]
	parameters:
	weight: 0.03
	merge_method: task_arithmetic
	base_model: Casual-Autopsy/L3-Umbral-Mind-RP-v0.3-8B
	dtype: bfloat16
	```