README.md · roleplaiapp/phi-4-Q5_K

phi-4-Q5_K_M-GGUF / README.md

Update readme

313ee2d verified 2 days ago

706 Bytes

	---
	pipeline_tag: text-generation
	tags:
	- nlp
	- code
	- llama-cpp
	- exllama
	- gguf
	- phi-4
	- phi
	- microsoft
	- gguf
	- code
	- math
	- chat
	- chat
	- conversational
	- roleplay
	- text-generation
	- safetensors
	library_name: transformers
	base_model: microsoft/phi-4
	---
	# phi-4-Q3_K_S-GGUF

	Original Model: `/microsoft/phi-4`
	Quantization Method: `GGUF`

	## Overview
	This is an GGUF Q5 KM quantized version of [phi-4](https://huggingface.co/microsoft/phi-4).

	## Quantization By
	I often have idle A100 GPUs while building/testing and training the RP app, so I put them to use quantizing models.

	I hope the community finds these quantizations useful.

	Andrew Webby @ [RolePlai](https://roleplai.app/)