Llama.cpp Support

by isr431 - opened Aug 27

isr431

Aug 27

Minitron 4b support was just added to llama.cpp, do GGUF quants have to be remade?

Anthracite org Aug 27

we had merged the PR for these quants, so everything should just work out of the box for you now! we'll update the readme soon to reflect that.

lucyknada changed discussion status to closed Aug 27

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment