This is a converted model to GGUF from nvidia/Mistral-NeMo-Minitron-8B-Instruct
quantized to Q2_K
using llama.cpp
library.
- Downloads last month
- 1
Inference Providers
NEW
This model is not currently available via any of the supported third-party Inference Providers, and
the model is not deployed on the HF Inference API.
Model tree for Manel/Mistral-NeMo-Minitron-8B-Instruct-Q2_K-GGUF
Base model
nvidia/Mistral-NeMo-Minitron-8B-Base
Finetuned
nvidia/Mistral-NeMo-Minitron-8B-Instruct