TheBloke
/

stable-vicuna-13B-GPTQ

Text Generation

text-generation-inference

4-bit precision

Model card Files Files and versions Community

TheBloke commited on Apr 29, 2023

Commit

5e06a28

·

1 Parent(s): 831916c

Update README.md

Files changed (1) hide show

README.md +9 -7

README.md CHANGED Viewed

@@ -35,17 +35,19 @@ This model works best with the following prompt template:
 ## How to easily download and use this model in text-generation-webui
-Load text-generation-webui as you normally do.
 1. Click the **Model tab**.
-2. Under **Download custom model or LoRA**, enter this repo name: `TheBloke/stable-vicuna-13B-GPTQ`.
 3. Click **Download**.
 4. Wait until it says it's finished downloading.
-5. As this is a GPTQ model, fill in the `GPTQ parameters` on the right: `Bits = 4`, `Groupsize = 128`, `model_type = Llama`
-6. Now click the **Refresh** icon next to **Model** in the top left.
-7. In the **Model drop-down**: choose this model: `stable-vicuna-13B-GPTQ`.
-8. Click **Reload the Model** in the top right.
-9. Once it says it's loaded, click the **Text Generation tab** and enter a prompt!
 ## Provided files

 ## How to easily download and use this model in text-generation-webui
+Open the text-generation-webui UI as normal.
 1. Click the **Model tab**.
+2. Under **Download custom model or LoRA**, enter `TheBloke/stable-vicuna-13B-GPTQ`.
 3. Click **Download**.
 4. Wait until it says it's finished downloading.
+5. Click the **Refresh** icon next to **Model** in the top left.
+6. In the **Model drop-down**: choose the model you just downloaded,`stable-vicuna-13B-GPTQ`.
+7. If you see an error in the bottom right, ignore it - it's temporary.
+8. Fill out the `GPTQ parameters` on the right: `Bits = 4`, `Groupsize = 128`, `model_type = Llama`
+9. Click **Save settings for this model** in the top right.
+10. Click **Reload the Model** in the top right.
+11. Once it says it's loaded, click the **Text Generation tab** and enter a prompt!
 ## Provided files