Update README.md
Browse files
README.md
CHANGED
@@ -5,7 +5,7 @@ inference: false
|
|
5 |
|
6 |
# OpenAssistant LLaMA 30B SFT 7 GGML
|
7 |
|
8 |
-
This
|
9 |
|
10 |
It is the result of merging the XORs from the above repo with the original Llama 30B weights, and then quantising to 4bit and 5bit GGML for CPU inference using [llama.cpp](https://github.com/ggerganov/llama.cpp).
|
11 |
|
|
|
5 |
|
6 |
# OpenAssistant LLaMA 30B SFT 7 GGML
|
7 |
|
8 |
+
This is a repo of GGML format models for [OpenAssistant's LLaMA 30B SFT 7](https://huggingface.co/OpenAssistant/oasst-sft-7-llama-30b-xor).
|
9 |
|
10 |
It is the result of merging the XORs from the above repo with the original Llama 30B weights, and then quantising to 4bit and 5bit GGML for CPU inference using [llama.cpp](https://github.com/ggerganov/llama.cpp).
|
11 |
|