bjoernp's picture
Create README.md
568177b verified
|
raw
history blame
518 Bytes
---
license: llama3
language:
- de
- en
library_name: transformers
---
# Llama3_DiscoLeo_Instruct_8B_v0.1_4bit_awq_glc
This model is a 4 bit quantization of [DiscoResearch/Llama3_DiscoLeo_Instruct_8B_v0.1](https://huggingface.co/DiscoResearch/Llama3_DiscoLeo_Instruct_8B_v0.1)
created using [AutoAWQ](https://github.com/casper-hansen/AutoAWQ) with a custom bilingual calibration dataset and `quant_config = {"zero_point": True, "q_group_size": 128, "w_bit": 4, "version": "GEMM"}`.
Copy of original model card:
...