MaziyarPanahi commited on
Commit
33cff3d
1 Parent(s): 4a6e8ed

add example how to download splits for 1 quant

Browse files
Files changed (1) hide show
  1. README.md +8 -0
README.md CHANGED
@@ -35,6 +35,14 @@ quantized_by: MaziyarPanahi
35
  ## Description
36
  [MaziyarPanahi/WizardLM-2-8x22B-GGUF](https://huggingface.co/MaziyarPanahi/WizardLM-2-8x22B-GGUF) contains GGUF format model files for [microsoft/WizardLM-2-8x22B](https://huggingface.co/microsoft/WizardLM-2-8x22B).
37
 
 
 
 
 
 
 
 
 
38
  ## Load sharded model
39
 
40
  `llama_load_model_from_file` will detect the number of files and will load additional tensors from the rest of files.
 
35
  ## Description
36
  [MaziyarPanahi/WizardLM-2-8x22B-GGUF](https://huggingface.co/MaziyarPanahi/WizardLM-2-8x22B-GGUF) contains GGUF format model files for [microsoft/WizardLM-2-8x22B](https://huggingface.co/microsoft/WizardLM-2-8x22B).
37
 
38
+ ## How to download
39
+ You can download only the quants you need instead of cloning the entire repository as follows:
40
+
41
+
42
+ ```
43
+ huggingface-cli download MaziyarPanahi/WizardLM-2-8x22B-GGUF --local-dir . --include '*Q2_K*gguf'
44
+ ```
45
+
46
  ## Load sharded model
47
 
48
  `llama_load_model_from_file` will detect the number of files and will load additional tensors from the rest of files.