QwQ-32B-Q4_K_M Cyclically thinking

by xstongxue2024 - opened about 16 hours ago

xstongxue2024

about 16 hours ago

I tried deploying QwQ-32B-Q4_K_M with Ollama, but it keeps generating an endless loop of self-asked and self-answered responses. Has anyone encountered the same issue?

Here is the configuration I used to import the model:

FROM ./QwQ-32B-Q4_K_M.gguf
PARAMETER temperature 0.6
PARAMETER min_p 0.00
PARAMETER repeat_penalty 1.0
PARAMETER top_k 40
PARAMETER top_p 0.95
PARAMETER num_predict 32768
STOP "<|im_start|>"
STOP "<|im_end|>"

xstongxue2024 changed discussion status to closed about 15 hours ago

shimmyshimmer

Unsloth AI org about 13 hours ago

See our guide for solving the issue: https://docs.unsloth.ai/basics/tutorial-how-to-run-qwq-32b-effectively

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment