Update README.md
Browse files
README.md
CHANGED
@@ -1,5 +1,7 @@
|
|
1 |
---
|
2 |
license: llama3.1
|
|
|
|
|
3 |
---
|
4 |
Llama 3.1 405B Quants and llama.cpp versions that is used for quantization
|
5 |
- IQ1_S: 86.8 GB - b3459
|
@@ -15,5 +17,4 @@ https://huggingface.co/meta-llama/Meta-Llama-3.1-405B-Instruct
|
|
15 |
|
16 |
imatrix file https://huggingface.co/nisten/meta-405b-instruct-cpu-optimized-gguf/blob/main/405imatrix.dat
|
17 |
|
18 |
-
Lmk if you need bigger quants.
|
19 |
-
|
|
|
1 |
---
|
2 |
license: llama3.1
|
3 |
+
base_model:
|
4 |
+
- meta-llama/Llama-3.1-405B-Instruct
|
5 |
---
|
6 |
Llama 3.1 405B Quants and llama.cpp versions that is used for quantization
|
7 |
- IQ1_S: 86.8 GB - b3459
|
|
|
17 |
|
18 |
imatrix file https://huggingface.co/nisten/meta-405b-instruct-cpu-optimized-gguf/blob/main/405imatrix.dat
|
19 |
|
20 |
+
Lmk if you need bigger quants.
|
|