Ba2han
/

qwen-coder-thinker-q4_k_m

Inference Endpoints

Model card Files Files and versions Community

qwen-coder-thinker-q4_k_m / README.md

Ba2han's picture

Update README.md

41c1b9d verified 12 days ago

|

history blame contribute delete

382 Bytes

	---
	language:
	- en
	base_model:
	- deepseek-ai/DeepSeek-R1-Distill-Qwen-32B
	---
	- Extracted a 64 Rank Lora from DeepSeek-R1-Distill-Qwen-32B
	- Merged & Quantized into Q4_K_M

	### Note: The model seems to be somewhat working with the R1's weird template too but it repeats random Chinese characters and the quality seems to be consistently worse.

	### Maybe try using the R1 tokenizer.