Edit model card
https://github.com/spydaz # LeroyDyer/Mixtral_AI_MultiToken-Q4_K_M-GGUF This model was converted to GGUF format from [`LeroyDyer/Mixtral_AI_MultiToken`](https://huggingface.co/LeroyDyer/Mixtral_AI_MultiToken) using llama.cpp. Refer to the [original model card](https://huggingface.co/LeroyDyer/Mixtral_AI_MultiToken) for more details on the model. ## Use with llama.cpp
brew install ggerganov/ggerganov/llama.cpp
llama-cli --hf-repo LeroyDyer/Mixtral_AI_MultiToken-Q4_K_M-GGUF --model mixtral_ai_multitoken.Q4_K_M.gguf -p "The meaning to life and the universe is "
llama-server --hf-repo LeroyDyer/Mixtral_AI_MultiToken-Q4_K_M-GGUF --model mixtral_ai_multitoken.Q4_K_M.gguf -c 2048
Downloads last month
609
GGUF
Model size
312M params
Architecture
clip

4-bit

Inference API
Inference API (serverless) does not yet support transformers models for this pipeline type.