llama_model_quantize: failed to quantize:

#223

by biu12 - opened 5 days ago

5 days ago

When I wanted to quantify the fine-tuned model(llama3 8b), I ran into this problem。
command:
sudo ./llama-quantize /root/hg_to_gguf.gguf/Llama_Lora_Merge-8.0B-F16.gguf /root/quan
tize_model q4_0
problem:
llama_model_quantize: failed to quantize: basic_ios::clear: iostream error
main: failed to quantize model from '/root/hg_to_gguf.gguf/Llama_Lora_Merge-8.0B-F16.gguf'

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment