Add support for EXL2 4 bit KV cache; switch from metric gigabytes (1e9 bytes) to JEDEC gigabytes (2^30 bytes) c8c7129 verified mo137 commited on Mar 16, 2024