Update README.md
Browse files
README.md
CHANGED
@@ -22,9 +22,7 @@ It is the result of quantising to 4bit and 5bit GGML for CPU inference using [ll
|
|
22 |
|
23 |
llama.cpp recently made a breaking change to its quantisation methods.
|
24 |
|
25 |
-
I have
|
26 |
-
|
27 |
-
The previous files, which will still work in older versions of llama.cpp, can be found in branch `previous_llama`.
|
28 |
|
29 |
## Provided files
|
30 |
| Name | Quant method | Bits | Size | RAM required | Use case |
|
|
|
22 |
|
23 |
llama.cpp recently made a breaking change to its quantisation methods.
|
24 |
|
25 |
+
I have quantised the GGML files in this repo with the latest version. Therefore you will require llama.cpp compiled on May 12th or later (commit `b9fd7ee` or later) to use them.
|
|
|
|
|
26 |
|
27 |
## Provided files
|
28 |
| Name | Quant method | Bits | Size | RAM required | Use case |
|