Add support for EXL2 4 bit KV cache; switch from metric gigabytes (1e9 bytes) to JEDEC gigabytes (2^30 bytes) (#2) 107ba5f verified NyxKrage mo137 commited on Mar 16, 2024