Update README.md
Browse files
README.md
CHANGED
@@ -3,14 +3,22 @@ license: llama3.1
|
|
3 |
tags:
|
4 |
- gguf
|
5 |
- llama3
|
6 |
-
- llama
|
7 |
pipeline_tag: text-generation
|
|
|
|
|
|
|
|
|
|
|
8 |
---
|
9 |
|
10 |
# Meta-Llama-3.1-405B-Instruct-GGUF
|
11 |
|
|
|
|
|
12 |
Low bit quantizations of Meta's Llama 3.1 405B Instruct model. Quantized from ollama q4_0 GGUF.
|
13 |
|
|
|
|
|
14 |
**Quants:**
|
15 |
- Q2_K
|
16 |
- (imatrix)
|
|
|
3 |
tags:
|
4 |
- gguf
|
5 |
- llama3
|
|
|
6 |
pipeline_tag: text-generation
|
7 |
+
datasets:
|
8 |
+
- froggeric/imatrix
|
9 |
+
language:
|
10 |
+
- en
|
11 |
+
library_name: ggml
|
12 |
---
|
13 |
|
14 |
# Meta-Llama-3.1-405B-Instruct-GGUF
|
15 |
|
16 |
+
![image/png](https://cdn-uploads.huggingface.co/production/uploads/6604e5b21eb292d6df393365/o7DiWuILyzaPLh4Ne1JKr.png)
|
17 |
+
|
18 |
Low bit quantizations of Meta's Llama 3.1 405B Instruct model. Quantized from ollama q4_0 GGUF.
|
19 |
|
20 |
+
Quantized with llama.cpp [b3449](https://github.com/ggerganov/llama.cpp/releases/tag/b3449)
|
21 |
+
|
22 |
**Quants:**
|
23 |
- Q2_K
|
24 |
- (imatrix)
|