Ba2han
/

qwen-coder-thinker-q4_k_m

Inference Endpoints

Model card Files Files and versions Community

qwen-coder-thinker-q4_k_m / README.md

Ba2han's picture

Update README.md

41c1b9d verified 11 days ago

|

history blame contribute delete

382 Bytes

metadata

language:
  - en
base_model:
  - deepseek-ai/DeepSeek-R1-Distill-Qwen-32B

Extracted a 64 Rank Lora from DeepSeek-R1-Distill-Qwen-32B
Merged & Quantized into Q4_K_M

Note: The model seems to be somewhat working with the R1's weird template too but it repeats random Chinese characters and the quality seems to be consistently worse.

Maybe try using the R1 tokenizer.