aiplanet
/

effi-7b-awq

Text Generation

text-generation-inference

4-bit precision

Model card Files Files and versions Community

bhavyaaiplanet commited on Feb 9, 2024

Commit

5bc7832

·

verified ·

1 Parent(s): e06a18a

Update README.md

Files changed (1) hide show

README.md +18 -8

README.md CHANGED Viewed

@@ -29,13 +29,11 @@ effi 7b AWQ is a quantized version of effi 7b whiich is a 7 billion parameter mo
 ### Qunatization Configuration
-  "zero_point": true,
-  "q_group_size": 128,
-  "w_bit": 4,
-  "version": "GEMM",
-  "modules_to_not_convert": null
@@ -79,4 +77,16 @@ print(f"{tokenizer.batch_decode(outputs.detach().cpu().numpy(), skip_special_tok
 ### Framework versions
 - Transformers 4.37.2
-- Autoawq 0.1.8

 ### Qunatization Configuration
+-zero_point: true,
+-q_group_size: 128,
+-w_bit: 4,
+-version: "GEMM",
+-modules_to_not_convert: null
 ### Framework versions
 - Transformers 4.37.2
+- Autoawq 0.1.8
+### Citation
+```
+@misc {bhavyaaiplanet,
+	author       = { {Bhavya Bhola} },
+	title        = { Quantized version of effi-7bb by AI Planet},
+	year         = 2024,
+	url          = { https://huggingface.co/aiplanet/effi-7b-awq },
+	publisher    = { Hugging Face }
+}
+```