Statuo
/

Deepseeker-Kunou-Qwen2.5-14b-EXL2-8bpw

Text Generation

text-generation-inference

Inference Endpoints

8-bit precision

Model card Files Files and versions Community

Statuo commited on 3 days ago

Commit

46e973e

·

verified ·

1 Parent(s): b3d71a6

Update README.md

Files changed (1) hide show

README.md +20 -2

README.md CHANGED Viewed

@@ -6,8 +6,26 @@ library_name: transformers
 tags:
 - mergekit
 - merge
 ---
 # merge
 This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
@@ -41,4 +59,4 @@ models:
 merge_method: linear
 dtype: float16
-```

 tags:
 - mergekit
 - merge
+license: apache-2.0
 ---
+![DeepseekerKunou](https://files.catbox.moe/n5ejwr.png)
+Been a while since I've seen you with a merge of my own, eh? This is released under the Qwen License which is part of the files in this quant and the main model.
+<br>
+Anyway, with the release of Deepseek R Distill, I gave it a poke and found what I expected. Not great for creative writing tasks but seems otherwise intelligent. So I decided to take a stab at another merge much like I did with LemonKunoichiWizard. What you're seeing here is the result of - give or take - four separate merges. Out of the merges, I feel this one is the best out of the four. That all being said, I think it increased the base intelligence of Kunou marginally which was nice to see. That being said, it's a 14b and I could be chugging a placebo so take that with a grain of salt. Thanks again to Sao10k for the finetune and the Deepseek team for releasing it under an open license. Hopefully you all enjoy.
+<br>
+I tested using SLERP as well, but the SLERP version was noticeably stupider. Only other thing of note is that it still has that Qwen tendency to occasionally just spam output the EoS tag sometimes.
+<br>
+<br>
+[This is the EXL2 8bpw version of this model. For the original model, go here](https://huggingface.co/Statuo/Deepseeker-Kunou)
+<br>
+[For the 6bpw version, go here](https://huggingface.co/Statuo/Deepseeker-Kunou-EXL2-6bpw)
+<br>
+[For the 4bpw version, go here](https://huggingface.co/Statuo/Deepseeker-Kunou-EXL2-4bpw)
+<br>
 # merge
 This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
 merge_method: linear
 dtype: float16
+```