EryriLabs commited on
Commit
154c064
·
verified ·
1 Parent(s): 4ca0dd1

Upload folder using huggingface_hub

Browse files
.gitattributes CHANGED
@@ -33,3 +33,10 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
 
 
 
 
 
 
 
 
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
36
+ deepseek-r1-distill-qwen-yara-thinker-7b.Q2_K.gguf filter=lfs diff=lfs merge=lfs -text
37
+ deepseek-r1-distill-qwen-yara-thinker-7b.Q3_K_M.gguf filter=lfs diff=lfs merge=lfs -text
38
+ deepseek-r1-distill-qwen-yara-thinker-7b.Q4_K_M.gguf filter=lfs diff=lfs merge=lfs -text
39
+ deepseek-r1-distill-qwen-yara-thinker-7b.Q5_K_M.gguf filter=lfs diff=lfs merge=lfs -text
40
+ deepseek-r1-distill-qwen-yara-thinker-7b.Q6_K.gguf filter=lfs diff=lfs merge=lfs -text
41
+ deepseek-r1-distill-qwen-yara-thinker-7b.Q8_0.gguf filter=lfs diff=lfs merge=lfs -text
42
+ deepseek-r1-distill-qwen-yara-thinker-7b.bf16.gguf filter=lfs diff=lfs merge=lfs -text
README.md ADDED
@@ -0,0 +1,40 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ base_model:
3
+ - deepseek-ai/DeepSeek-R1-Distill-Qwen-7B
4
+ - vtriple/Qwen-2.5-7B-Threatflux
5
+ library_name: transformers
6
+ tags:
7
+ - mergekit
8
+ - merge
9
+ - autoquant
10
+ - gguf
11
+ ---
12
+ # DeepSeek-R1-Distill-Qwen-YARA-Thinker-7B
13
+
14
+ This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
15
+
16
+ ## Merge Details
17
+ ### Merge Method
18
+
19
+ This model was merged using the SLERP merge method.
20
+
21
+ ### Models Merged
22
+
23
+ The following models were included in the merge:
24
+ * [deepseek-ai/DeepSeek-R1-Distill-Qwen-7B](https://huggingface.co/deepseek-ai/DeepSeek-R1-Distill-Qwen-7B)
25
+ * [vtriple/Qwen-2.5-7B-Threatflux](https://huggingface.co/vtriple/Qwen-2.5-7B-Threatflux)
26
+
27
+ ### Configuration
28
+
29
+ The following YAML configuration was used to produce this model:
30
+
31
+ ```yaml
32
+ models:
33
+ - model: deepseek-ai/DeepSeek-R1-Distill-Qwen-7B
34
+ - model: vtriple/Qwen-2.5-7B-Threatflux
35
+ merge_method: slerp
36
+ base_model: deepseek-ai/DeepSeek-R1-Distill-Qwen-7B
37
+ dtype: bfloat16
38
+ parameters:
39
+ t: [0, 0.5, 0.25]
40
+ ```
deepseek-r1-distill-qwen-yara-thinker-7b.Q2_K.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:288d4eda4db8bd088605d52c0dcd521fc5e062b95bd0bf53ef173dcd173af144
3
+ size 3015940416
deepseek-r1-distill-qwen-yara-thinker-7b.Q3_K_M.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:4f1e2c8ad567a26298cf095499c82a8d8dae1493d8af9c6be986cf47249987f7
3
+ size 3808391488
deepseek-r1-distill-qwen-yara-thinker-7b.Q4_K_M.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:45ca7778b46db724e1482fa8f488ba155b68f990b279a6394db7b0f22bd1bcf5
3
+ size 4683073856
deepseek-r1-distill-qwen-yara-thinker-7b.Q5_K_M.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:666d5a97b00a31a546dbcc6bcd4d98f778f9ca3ba7b042a6c67afdd8b990af79
3
+ size 5444831552
deepseek-r1-distill-qwen-yara-thinker-7b.Q6_K.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:5d5ae536651607df9de47bceaaa59659a860264c434fec110cdb4f0e6f19a56a
3
+ size 6254199104
deepseek-r1-distill-qwen-yara-thinker-7b.Q8_0.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:e1936c7f36174e4f00b7387148e1dc79010f3ca8a96cb762e81ded77fb8b9721
3
+ size 8098525504
deepseek-r1-distill-qwen-yara-thinker-7b.bf16.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:e852a22d97f936f423a688ca0e854a3e84b29e8b580226ccf7f50ac6e26bbd80
3
+ size 15237853504