|
--- |
|
pipeline_tag: text-generation |
|
tags: |
|
- nlp |
|
- code |
|
- llama-cpp |
|
- exllama |
|
- gguf |
|
- phi-4 |
|
- phi |
|
- microsoft |
|
- gguf |
|
- code |
|
- math |
|
- chat |
|
- chat |
|
- conversational |
|
- roleplay |
|
- text-generation |
|
- safetensors |
|
library_name: transformers |
|
base_model: microsoft/phi-4 |
|
--- |
|
# phi-4-Q3_K_S-GGUF |
|
|
|
**Original Model:** `/microsoft/phi-4` |
|
**Quantization Method:** `GGUF` |
|
|
|
## Overview |
|
This is an GGUF Q5 KM quantized version of [phi-4](https://huggingface.co/microsoft/phi-4). |
|
|
|
## Quantization By |
|
I often have idle A100 GPUs while building/testing and training the RP app, so I put them to use quantizing models. |
|
|
|
I hope the community finds these quantizations useful. |
|
|
|
Andrew Webby @ [RolePlai](https://roleplai.app/) |
|
|
|
|