iMatrix gguf quants of a newer finetune of Mixtral-8x22B

EdgeQuants still underway, IQ4XS version recommended. Make sure to combine/merge the parts back together before using

cat tessIQ4XS.gguf.part* > tessIQ4XS.gguf

Then use with llama.cpp version from April 12 or older. April 13 release had massive changes and messed up inferene for MoE models

Downloads last month: 2

GGUF

Model size

141B params

Architecture

llama

View all files

Inference Providers NEW

This model is not currently available via any of the supported third-party Inference Providers, and HF Inference API was unable to determine this model's library.

Model tree for nisten/Tess-Mixtral-8x22B-imatrix-gguf

Base model

migtissera/Tess-2.0-Mixtral-8x22B

Quantized

(2)

this model