Update README.md
Browse files
README.md
CHANGED
@@ -7,14 +7,14 @@ Available AWQ Quantisations:
|
|
7 |
|
8 |
| Revision | Model Size | Group Size | w_bit |
|
9 |
|--------------|------------|------------|-------|
|
10 |
-
| `main` |
|
11 |
-
| `gs_2_4bit` |
|
12 |
-
| `gs_4_4bit` |
|
13 |
-
| `gs_8_4bit` |
|
14 |
-
| `gs_16_4bit` |
|
15 |
-
| `gs_64_4bit` |
|
16 |
-
| `gs_128_4bit`|
|
17 |
-
| `gs_512_4bit`|
|
18 |
|
19 |
**This model was based on [meta-llama/Meta-Llama-3.1-8B-Instruct](https://huggingface.co/meta-llama/Meta-Llama-3.1-8B-Instruct)**
|
20 |
|
|
|
7 |
|
8 |
| Revision | Model Size | Group Size | w_bit |
|
9 |
|--------------|------------|------------|-------|
|
10 |
+
| `main` | ~6.13 GB | 32 | 4 |
|
11 |
+
| `gs_2_4bit` | ~x.xx GB | 2 | 4 |
|
12 |
+
| `gs_4_4bit` | ~x.xx GB | 4 | 4 |
|
13 |
+
| `gs_8_4bit` | ~x.xx GB | 8 | 4 |
|
14 |
+
| [`gs_16_4bit`](https://huggingface.co/hivata/functionary-small-v3.2-AWQ/tree/gs_16-4bit) | ~6.68 GB | 16 | 4 |
|
15 |
+
| `gs_64_4bit` | ~x.xx GB | 64 | 4 |
|
16 |
+
| `gs_128_4bit`| ~x.xx GB | 128 | 4 |
|
17 |
+
| `gs_512_4bit`| ~x.xx GB | 512 | 4 |
|
18 |
|
19 |
**This model was based on [meta-llama/Meta-Llama-3.1-8B-Instruct](https://huggingface.co/meta-llama/Meta-Llama-3.1-8B-Instruct)**
|
20 |
|