TheBloke
/

llama-2-70b-Guanaco-QLoRA-fp16

Text Classification

text-generation

text-generation-inference

Model card Files Files and versions Community

TheBloke commited on Jul 21, 2023

Commit

6434247

•

1 Parent(s): aaa51b1

Update README.md

Files changed (1) hide show

README.md +4 -4

README.md CHANGED Viewed

@@ -29,14 +29,14 @@ These files are pytorch format fp16 model files for [Mikael10's Llama2 70b Guana
 It is the result of merging and/or converting the source repository to float16.
 ## Repositories available
 * [GPTQ models for GPU inference, with multiple quantisation parameter options.](https://huggingface.co/TheBloke/llama-2-70b-Guanaco-QLoRA-GPTQ)
-* [2, 3, 4, 5, 6 and 8-bit GGML models for CPU+GPU inference](https://huggingface.co/TheBloke/llama-2-70b-Guanaco-QLoRA-GGML)
-* [Original unquantised fp16 model in pytorch format, for GPU inference and for further conversions](https://huggingface.co/Mikael110/llama-2-70b-guanaco-qlora)
-## Prompt template: %%PROMPT_TEMPLATE_TITLE
 ```
 ### Human: {prompt}

 It is the result of merging and/or converting the source repository to float16.
 ## Repositories available
 * [GPTQ models for GPU inference, with multiple quantisation parameter options.](https://huggingface.co/TheBloke/llama-2-70b-Guanaco-QLoRA-GPTQ)
+* [GGML experimental 4, 5, 6 and 8-bit models for CPU only inference](https://huggingface.co/TheBloke/llama-2-70b-Guanaco-QLoRA-GGML)
+* [Merged fp16 model in pytorch model format for GPU inference and further conversions](https://huggingface.co/TheBloke/llama-2-70b-Guanaco-QLoRA-fp16)
+* [Original QLoRA model](https://huggingface.co/Mikael110/llama-2-70b-guanaco-qlora)
+## Prompt template: Guanaco
 ```
 ### Human: {prompt}