interrobang
commited on
Update README.md
Browse files
README.md
CHANGED
@@ -6,4 +6,4 @@ A test quantization of OpenHermes-2.5-Mistral-7B by teknium using importance mat
|
|
6 |
|
7 |
Importance matrix was computed in roughly 20 minutes with a Ryzen 5 3550H and GTX 1650 with 8 layers offloaded.
|
8 |
|
9 |
-
Will be updated with perplexity testing later
|
|
|
6 |
|
7 |
Importance matrix was computed in roughly 20 minutes with a Ryzen 5 3550H and GTX 1650 with 8 layers offloaded.
|
8 |
|
9 |
+
Will be updated with perplexity testing later, probably? 😭 Haven't done proper tests quite yet, feels better than old quants when chatting in Ukrainian, hopefully I get around to actually benching it somehow
|