QwQ-32B-Q4_K_M Cyclically thinking
#6
by
xstongxue2024
- opened
I tried deploying QwQ-32B-Q4_K_M with Ollama, but it keeps generating an endless loop of self-asked and self-answered responses. Has anyone encountered the same issue?
Here is the configuration I used to import the model:
FROM ./QwQ-32B-Q4_K_M.gguf
PARAMETER temperature 0.6
PARAMETER min_p 0.00
PARAMETER repeat_penalty 1.0
PARAMETER top_k 40
PARAMETER top_p 0.95
PARAMETER num_predict 32768
STOP "<|im_start|>"
STOP "<|im_end|>"
xstongxue2024
changed discussion status to
closed
See our guide for solving the issue: https://docs.unsloth.ai/basics/tutorial-how-to-run-qwq-32b-effectively