flammenai
/

Mahou-1.3b-llama3-8B

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Mahou-1.3b-llama3-8B / README.md

nbeerbower's picture

Update README.md

e0412c4 verified 7 months ago

|

history blame contribute delete

1.26 kB

	---
	library_name: transformers
	license: llama3
	base_model:
	- nbeerbower/llama-3-SNAMD-8B
	datasets:
	- flammenai/MahouMix-v1
	- flammenai/FlameMix-DPO-v1
	---
	![image/png](https://huggingface.co/flammenai/Mahou-1.0-mistral-7B/resolve/main/mahou1.png)

	# Mahou-1.3b-llama3-8B

	Mahou is designed to provide short messages in a conversational context. It is capable of casual conversation and character roleplay.

	### Chat Format

	This model has been trained to use ChatML format.

	```
	<\|im_start\|>system
	{{system}}<\|im_end\|>
	<\|im_start\|>{{char}}
	{{message}}<\|im_end\|>
	<\|im_start\|>{{user}}
	{{message}}<\|im_end\|>
	```

	### Roleplay Format

	- Speech without quotes.
	- Actions in `asterisks`

	```
	leans against wall cooly so like, i just casted a super strong spell at magician academy today, not gonna lie, felt badass.
	```

	### SillyTavern Settings

	1. Use ChatML for the Context Template.
	2. Enable Instruct Mode.
	3. Use the [Mahou preset](https://huggingface.co/datasets/flammenai/Mahou-ST-ChatML-Instruct/raw/main/Mahou.json).
	4. Recommended Additonal stopping strings: `["\n", "<\|", "</"]`

	### Method

	ORPO finetuned on 2x RTX 4060 Ti for 1 epoch.

	[Fine-tune Llama 3 with ORPO](https://mlabonne.github.io/blog/posts/2024-04-19_Fine_tune_Llama_3_with_ORPO.html)