Possible to convert to GGUF?

by jackboot - opened Sep 16, 2023

Sep 16, 2023

You think it would run on llama.cpp?

Owner Sep 18, 2023

GGUF is not just a format, but rather a framework, which, most certainly, doesn't support this model.

Sep 26, 2023

Did you try GPTQ as well? Or the only option is to load in 4 bit with bnb?

Owner Sep 26, 2023

GPTQ is also very model-specific. I do not know or any out-of-the-box quantization solutions apart from BnB.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment