Chroma-GGUF / README.md
Clybius's picture
Update README.md
3d87aca verified
|
raw
history blame
599 Bytes
metadata
license: apache-2.0
base_model:
  - lodestones/Chroma

Chroma is a 8.9 billion parameter rectified flow transformer capable of generating images from text descriptions. Based on FLUX.1 [schnell] with heavy architectural modifications.

Quantized into GGUF format using a modified llama.cpp & city96's ComfyUI-GGUF/tools. Distillation layers are not quantized.

Q*_M GGUFs are mixed quantizations with an aim at maximizing speed by selectively choosing the quantization of certain layers.

  • Q8_M focuses on Q8_0 quantization of weights for performance, mixed with Q6_K on less heavy layers.