R136a1 commited on
Commit
d7e6d06
1 Parent(s): 4cac686

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +32 -0
README.md ADDED
@@ -0,0 +1,32 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: other
3
+ language:
4
+ - en
5
+ ---
6
+
7
+ [EXL2](https://github.com/turboderp/exllamav2/tree/master#exllamav2) Quantization of [Undi95's's MM-ReMM-L2-20B](https://huggingface.co/Undi95/MM-ReMM-L2-20B).
8
+
9
+
10
+ ## Model details
11
+
12
+ Quantized at 3.18bpw with hb 6, This one can actually go full 4K context on 16GB VRAM, will redo the other 20b models later
13
+
14
+ Perplexity:
15
+
16
+ Base = 6.9504
17
+
18
+ 3.23 h8 = 7.0138
19
+
20
+ Dataset = [wikitext](https://huggingface.co/datasets/wikitext/resolve/refs%2Fconvert%2Fparquet/wikitext-2-v1/test/0000.parquet)
21
+
22
+ ## Prompt Format
23
+
24
+ ```
25
+ Below is an instruction that describes a task. Write a response that appropriately completes the request.
26
+
27
+ ### Instruction:
28
+ {prompt}
29
+
30
+ ### Response:
31
+
32
+ ```