Pinkstack commited on
Commit
e057edf
·
verified ·
1 Parent(s): e0e26fe

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +32 -3
README.md CHANGED
@@ -6,17 +6,46 @@ tags:
6
  - unsloth
7
  - qwen2
8
  - gguf
 
 
 
9
  license: apache-2.0
10
  language:
11
  - en
 
12
  ---
13
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
14
  # Uploaded model
15
 
16
  - **Developed by:** Pinkstack
17
  - **License:** apache-2.0
18
  - **Finetuned from model :** Pinkstack/PARM-V1.5-QwQ-Qwen-2.5-o1-3B-VLLM
19
 
20
- This qwen2 model was trained 2x faster with [Unsloth](https://github.com/unslothai/unsloth) and Huggingface's TRL library.
21
-
22
- [<img src="https://raw.githubusercontent.com/unslothai/unsloth/main/images/unsloth%20made%20with%20love.png" width="200"/>](https://github.com/unslothai/unsloth)
 
6
  - unsloth
7
  - qwen2
8
  - gguf
9
+ - Reasoning
10
+ - o1
11
+ - qwq
12
  license: apache-2.0
13
  language:
14
  - en
15
+ pipeline_tag: text-generation
16
  ---
17
 
18
+ [Pinkstack.png](https://cdn-uploads.huggingface.co/production/uploads/6710ba6af1279fe0dfe33afe/2xMulpuSlZ3C1vpGgsAYi.png)
19
+
20
+ 🧀 Which quant is right for you?
21
+
22
+ - ***Q4:*** This model should be used on edge devices like high end phones or laptops due to its very compact size, quality is okay but fully usable.
23
+ - ***Q8:*** This model should be used on most high end modern devices like rtx 3080, Responses are very high quality, but its noticeably slower than q4
24
+
25
+ This Parm v2 is based on Qwen 2.5 3B which has gotten many extra reasoning training parameters so it would have similar outputs to qwen QwQ / O.1 mini (only much, smaller.), We trained using [this](https://huggingface.co/datasets/gghfez/QwQ-LongCoT-130K-cleaned) dataset, opus dataset and sonnet 3.5 dataset from huggingface.
26
+ This is a pretty heavy to run model if you want on device ai's for phones I'd recommend using the 0.5B version of this model (coming soon)
27
+
28
+
29
+ To use this model, you must use a service which supports the GGUF file format.
30
+ Additionaly, this is the Prompt Template: it uses the qwen2 template.
31
+ ```
32
+ {{ if .System }}<|system|>
33
+ {{ .System }}<|end|>
34
+ {{ end }}{{ if .Prompt }}<|user|>
35
+ {{ .Prompt }}<|end|>
36
+ {{ end }}<|assistant|>
37
+ {{ .Response }}<|end|>
38
+ ```
39
+
40
+ Or if you are using an anti prompt: <|end|><|assistant|>
41
+
42
+ Highly recommended to use with a system prompt.
43
+
44
+
45
  # Uploaded model
46
 
47
  - **Developed by:** Pinkstack
48
  - **License:** apache-2.0
49
  - **Finetuned from model :** Pinkstack/PARM-V1.5-QwQ-Qwen-2.5-o1-3B-VLLM
50
 
51
+ This ai model was trained with [Unsloth](https://github.com/unslothai/unsloth) and Huggingface's TRL library.