Still not ok with new llama-cpp version and llama.bin files

by Alwmd - opened Sep 4, 2023

Discussion

Alwmd

Sep 4, 2023

I try all the 4Q bin file and always get the magic number error issue

TheBloke

Owner Sep 4, 2023

Sorry for the long delay with this - I have finally uploaded GGUF models for this! Please use those instead; GGML is dead.

sujantkumarkv

Sep 10, 2023

•

edited Sep 10, 2023

so GGML versions are dead, i think maybe you'll remove them from your account or something? I was getting errors due to it, just now got the GGUF working.

Alwmd

Sep 15, 2023

Have you positive experience on making it run on GPU with llama-cpp-python?

sujantkumarkv

Sep 19, 2023

I haven't tried llama-cpp-python.

TheBloke

Owner Sep 19, 2023

Just use https://huggingface.co/TheBloke/Llama-2-7B-GGUF and the other GGUF files. Yes I might remove these GGML models at some point, but there is already a clear note at the top of the repo indicating they are deprecated

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment