Triangle104
/

Mahou-1.5-mistral-nemo-12B-Q8_0-GGUF

Inference Endpoints

Model card Files Files and versions Community

Triangle104 commited on Oct 10

Commit

86bf81f

•

1 Parent(s): c37f73d

Update README.md

Files changed (1) hide show

README.md +48 -0

README.md CHANGED Viewed

@@ -108,6 +108,54 @@ model-index:
 This model was converted to GGUF format from [`flammenai/Mahou-1.5-mistral-nemo-12B`](https://huggingface.co/flammenai/Mahou-1.5-mistral-nemo-12B) using llama.cpp via the ggml.ai's [GGUF-my-repo](https://huggingface.co/spaces/ggml-org/gguf-my-repo) space.
 Refer to the [original model card](https://huggingface.co/flammenai/Mahou-1.5-mistral-nemo-12B) for more details on the model.
 ## Use with llama.cpp
 Install llama.cpp through brew (works on Mac and Linux)

 This model was converted to GGUF format from [`flammenai/Mahou-1.5-mistral-nemo-12B`](https://huggingface.co/flammenai/Mahou-1.5-mistral-nemo-12B) using llama.cpp via the ggml.ai's [GGUF-my-repo](https://huggingface.co/spaces/ggml-org/gguf-my-repo) space.
 Refer to the [original model card](https://huggingface.co/flammenai/Mahou-1.5-mistral-nemo-12B) for more details on the model.
+---
+Model details:
+-
+Mahou-1.5-mistral-nemo-12B
+Mahou is designed to provide short messages in a conversational context. It is capable of casual conversation and character roleplay.
+Chat Format
+This model has been trained to use ChatML format.
+<|im_start|>system
+{{system}}<|im_end|>
+<|im_start|>{{char}}
+{{message}}<|im_end|>
+<|im_start|>{{user}}
+{{message}}<|im_end|>
+Roleplay Format
+    Speech without quotes.
+    Actions in *asterisks*
+*leans against wall cooly* so like, i just casted a super strong spell at magician academy today, not gonna lie, felt badass.
+SillyTavern Settings
+    Use ChatML for the Context Template.
+    Enable Instruct Mode.
+    Use the Mahou ChatML Instruct preset.
+    Use the Mahou Sampler preset.
+Method
+ORPO finetuned with 4x H100 for 3 epochs.
+Open LLM Leaderboard Evaluation Results
+Detailed results can be found here
+Metric 	Value
+Avg. 	26.28
+IFEval (0-Shot) 	67.51
+BBH (3-Shot) 	36.26
+MATH Lvl 5 (4-Shot) 	5.06
+GPQA (0-shot) 	3.47
+MuSR (0-shot) 	16.47
+MMLU-PRO (5-shot) 	28.91
+---
 ## Use with llama.cpp
 Install llama.cpp through brew (works on Mac and Linux)