File size: 152 Bytes
b11cd75 |
1 2 3 |
# LLama2 Model with FP8 KV Cache checkpoint for TRTLM
Generated using https://github.com/vllm-project/vllm/blob/main/examples/fp8/quantizer/quantize.py |
b11cd75 |
1 2 3 |
# LLama2 Model with FP8 KV Cache checkpoint for TRTLM
Generated using https://github.com/vllm-project/vllm/blob/main/examples/fp8/quantizer/quantize.py |