--- pipeline_tag: text-generation tags: - nlp - code - llama-cpp - exllama - gguf - phi-4 - phi - microsoft - gguf - code - math - chat - chat - conversational - roleplay - text-generation - safetensors library_name: transformers base_model: microsoft/phi-4 --- # phi-4-Q3_K_S-GGUF **Original Model:** `/microsoft/phi-4` **Quantization Method:** `GGUF` ## Overview This is an GGUF Q5 KM quantized version of [phi-4](https://huggingface.co/microsoft/phi-4). ## Quantization By I often have idle A100 GPUs while building/testing and training the RP app, so I put them to use quantizing models. I hope the community finds these quantizations useful. Andrew Webby @ [RolePlai](https://roleplai.app/)