etemiz
/

Llama-3.1-405B-Inst-GGUF

Inference Endpoints

Model card Files Files and versions Community

etemiz commited on Oct 7

Commit

af207b4

•

1 Parent(s): a07d189

Update README.md

Files changed (1) hide show

README.md +3 -2

README.md CHANGED Viewed

@@ -1,5 +1,7 @@
 ---
 license: llama3.1
 ---
 Llama 3.1 405B Quants and llama.cpp versions that is used for quantization
 - IQ1_S: 86.8 GB - b3459
@@ -15,5 +17,4 @@ https://huggingface.co/meta-llama/Meta-Llama-3.1-405B-Instruct
 imatrix file https://huggingface.co/nisten/meta-405b-instruct-cpu-optimized-gguf/blob/main/405imatrix.dat
-Lmk if you need bigger quants.

 ---
 license: llama3.1
+base_model:
+- meta-llama/Llama-3.1-405B-Instruct
 ---
 Llama 3.1 405B Quants and llama.cpp versions that is used for quantization
 - IQ1_S: 86.8 GB - b3459
 imatrix file https://huggingface.co/nisten/meta-405b-instruct-cpu-optimized-gguf/blob/main/405imatrix.dat
+Lmk if you need bigger quants.