AzureBlack
/

Dawn-v2-70B-exl2

Text Generation

Not-For-All-Audiences

nsfw

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

AzureBlack commited on Nov 9, 2023

Commit

fca2891

•

1 Parent(s): 501240a

Update README.md

Files changed (1) hide show

README.md +5 -1

README.md CHANGED Viewed

@@ -13,9 +13,13 @@ Requires ExllamaV2, which is being developed by turboderp https://github.com/tur
 Main branch is 4.6bpw 8h (req ??gb)
 5.0b8h branch is 5.0bpw 8h (req ??gb)
-2.5b8h branch is 2.55bpw 8h (req 24gb and the 8b cache setting)
 --------
 <center>[<a href="https://huggingface.co/Undi95/Dawn-v2-70B">fp16</a> - <a href="https://huggingface.co/Undi95/Dawn-v2-70B-GGUF">gguf</a> - exl2 : <a href="https://huggingface.co/Undi95/Dawn-v2-70B-2.4bpw-h6-exl2">2.40bpw</a> - <a href="https://huggingface.co/Undi95/Dawn-v2-70B-2.55bpw-h6-exl2">2.55bpw</a>]</center>

 Main branch is 4.6bpw 8h (req ??gb)
+2.5b8h branch is 2.55bpw 8h (req 24gb and the 8b cache setting) - Add BOS token must be unchecked at this weight or output is nonsense.
 5.0b8h branch is 5.0bpw 8h (req ??gb)
+6b8h branch is coming soon
 --------
 <center>[<a href="https://huggingface.co/Undi95/Dawn-v2-70B">fp16</a> - <a href="https://huggingface.co/Undi95/Dawn-v2-70B-GGUF">gguf</a> - exl2 : <a href="https://huggingface.co/Undi95/Dawn-v2-70B-2.4bpw-h6-exl2">2.40bpw</a> - <a href="https://huggingface.co/Undi95/Dawn-v2-70B-2.55bpw-h6-exl2">2.55bpw</a>]</center>