AzureBlack
/

Dawn-v2-70B-exl2

Text Generation

Not-For-All-Audiences

nsfw

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

AzureBlack commited on Dec 17, 2023

Commit

70386be

•

1 Parent(s): fca2891

Update README.md

Files changed (1) hide show

README.md +3 -3

README.md CHANGED Viewed

@@ -7,17 +7,17 @@ tags:
 ExllamaV2 version of the model created by [Undi](https://huggingface.co/Undi95)!
-Original Model https://huggingface.co/Undi95/Dawn-v2-70B-f16
 Requires ExllamaV2, which is being developed by turboderp https://github.com/turboderp/exllamav2 under an MIT license.
 Main branch is 4.6bpw 8h (req ??gb)
-2.5b8h branch is 2.55bpw 8h (req 24gb and the 8b cache setting) - Add BOS token must be unchecked at this weight or output is nonsense.
 5.0b8h branch is 5.0bpw 8h (req ??gb)
-6b8h branch is coming soon

 ExllamaV2 version of the model created by [Undi](https://huggingface.co/Undi95)!
+Original Model https://huggingface.co/Undi95/Dawn-v2-70B
 Requires ExllamaV2, which is being developed by turboderp https://github.com/turboderp/exllamav2 under an MIT license.
 Main branch is 4.6bpw 8h (req ??gb)
+2.5b8h branch is 2.5bpw 8h (req 24gb and the 8b cache setting) - Add BOS token must be unchecked at this weight or output is nonsense. New quant method applied 12/17/2023
 5.0b8h branch is 5.0bpw 8h (req ??gb)
+6b8h branch is 6.0bpw 8h requires between 60-72gb