This repository hosts quantized versions of the IBM Mistral-Small-24B-Instruct-2501 model.
Format: GGUFConverter: llama.cpp bfcce4d693617ec843d0b2510f6ee16e6bc6720dQuantizer: LM-Kit.NET 2024.2.1
For more detailed information on the base model, please visit the following link:
3-bit
4-bit
5-bit
6-bit
8-bit
16-bit