jeremy-costello
/

vicuna-13b-v1.1-4bit-128g

Text Generation

Model card Files Files and versions Community

jeremy-costello commited on Apr 15, 2023

Commit

847edef

•

1 Parent(s): 6826888

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -4,7 +4,7 @@ inference: false
 ---
 4-bit quantization of the vicuna-13b-v1.1 model.
-The delta was added to the original LLaMa weights using FastChat.
 Quantization and inference with GPTQ-For-LLaMa (commit 58c8ab4).
 Quantization args: $MODEL_DIRECTORY, c4, wbits 4, true-sequential, act-order, groupsize 128. \

 ---
 4-bit quantization of the vicuna-13b-v1.1 model.
+The delta was added to the original LLaMa weights using FastChat. \
 Quantization and inference with GPTQ-For-LLaMa (commit 58c8ab4).
 Quantization args: $MODEL_DIRECTORY, c4, wbits 4, true-sequential, act-order, groupsize 128. \