how to inference with llama. cpp

by anse10rville - opened May 5

May 5

If just run like ./main --m path/to/this in llama.cpp. It will rasie error "GGML_ASSERT: examples/main/main.cpp:248: llama_add_eos_token(model) != 1", local llama.cpp works well with other GGUFs, eg.Qwen/Qwen1.5-7B-Chat-GGUF, make some notes pls.
thks

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment