gelukuMLG
/

L3-Theta-Cat-2x8B-Bf16-GGUF

Inference Endpoints

Model card Files Files and versions Community

gelukuMLG commited on May 16

Commit

3e0ec1c

•

1 Parent(s): 09b0e01

Update README.md

Files changed (1) hide show

README.md +14 -3

README.md CHANGED Viewed

@@ -1,3 +1,14 @@
----
-license: llama3
----

+---
+license: llama3
+---
+This is an experimental 2x8B moe with random gates, using the following 2 models
+- Hermes-2-Theta-l3-8B by Nous Research https://huggingface.co/NousResearch/Hermes-2-Theta-Llama-3-8B
+- llama-3-cat-8B-instruct-V1 by TheSkullery https://huggingface.co/TheSkullery/llama-3-cat-8b-instruct-v1
+***Important***
+Make sure to add `</s>` a stop sequence as it uses llama-3-cat-8B-instruct-V1 as the base model.