Edit model card

ggml_llava-v1.5-13b

This repo contains GGUF files to inference llava-v1.5-13b with llama.cpp end-to-end without any extra dependency.

Note: The mmproj-model-f16.gguf file structure is experimental and may change. Always use the latest code in llama.cpp.

Downloads last month
22,157
GGUF
Model size
322M params
Architecture
clip

4-bit

5-bit

16-bit

Inference API
Unable to determine this model's library. Check the docs .