gelukuMLG
/

L3-Theta-Cat-2x8B-Bf16-GGUF

Model card Files Files and versions Community

gelukuMLG commited on May 17

Commit

8bd52c7

•

1 Parent(s): 5f9c991

Update README.md

Files changed (1) hide show

README.md +22 -1

README.md CHANGED Viewed

@@ -11,4 +11,25 @@ This is an experimental 2x8B moe with random gates, using the following 2 models
 ***Important***
-Make sure to add `</s>` a stop sequence as it uses llama-3-cat-8B-instruct-V1 as the base model.

 ***Important***
+Make sure to add `</s>` a stop sequence as it uses llama-3-cat-8B-instruct-V1 as the base model.
+Update:
+Due to request i decided to add the rest of the quants. Enjoy
+Mergekit recipe of the model if too lazy to check the files:
+```
+base_model: TheSkullery/llama-3-cat-8b-instruct-v1
+gate_mode: random
+dtype: bfloat16
+experts_per_token: 2
+experts:
+ - source_model: TheSkullery/llama-3-cat-8b-instruct-v1
+   positive_prompts:
+    - " "
+ - source_model: NousResearch/Hermes-2-Theta-Llama-3-8B
+   positive_prompts:
+    - " "
+```