Llama 3.1 GPTQ, AWQ, and BNB Quants Collection Optimised Quants for high-throughput deployments! Compatible with Transformers, TGI & VLLM 🤗 • 9 items • Updated Sep 26, 2024 • 56
hugging-quants/Meta-Llama-3.1-70B-Instruct-GPTQ-INT4 Text Generation • Updated Aug 7, 2024 • 4.63k • 22
hugging-quants/Meta-Llama-3.1-70B-Instruct-AWQ-INT4 Text Generation • Updated Aug 7, 2024 • 255k • 96