Update README.md
Browse files
README.md
CHANGED
@@ -6,17 +6,46 @@ tags:
|
|
6 |
- unsloth
|
7 |
- qwen2
|
8 |
- gguf
|
|
|
|
|
|
|
9 |
license: apache-2.0
|
10 |
language:
|
11 |
- en
|
|
|
12 |
---
|
13 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
14 |
# Uploaded model
|
15 |
|
16 |
- **Developed by:** Pinkstack
|
17 |
- **License:** apache-2.0
|
18 |
- **Finetuned from model :** Pinkstack/PARM-V1.5-QwQ-Qwen-2.5-o1-3B-VLLM
|
19 |
|
20 |
-
This
|
21 |
-
|
22 |
-
[<img src="https://raw.githubusercontent.com/unslothai/unsloth/main/images/unsloth%20made%20with%20love.png" width="200"/>](https://github.com/unslothai/unsloth)
|
|
|
6 |
- unsloth
|
7 |
- qwen2
|
8 |
- gguf
|
9 |
+
- Reasoning
|
10 |
+
- o1
|
11 |
+
- qwq
|
12 |
license: apache-2.0
|
13 |
language:
|
14 |
- en
|
15 |
+
pipeline_tag: text-generation
|
16 |
---
|
17 |
|
18 |
+
[Pinkstack.png](https://cdn-uploads.huggingface.co/production/uploads/6710ba6af1279fe0dfe33afe/2xMulpuSlZ3C1vpGgsAYi.png)
|
19 |
+
|
20 |
+
🧀 Which quant is right for you?
|
21 |
+
|
22 |
+
- ***Q4:*** This model should be used on edge devices like high end phones or laptops due to its very compact size, quality is okay but fully usable.
|
23 |
+
- ***Q8:*** This model should be used on most high end modern devices like rtx 3080, Responses are very high quality, but its noticeably slower than q4
|
24 |
+
|
25 |
+
This Parm v2 is based on Qwen 2.5 3B which has gotten many extra reasoning training parameters so it would have similar outputs to qwen QwQ / O.1 mini (only much, smaller.), We trained using [this](https://huggingface.co/datasets/gghfez/QwQ-LongCoT-130K-cleaned) dataset, opus dataset and sonnet 3.5 dataset from huggingface.
|
26 |
+
This is a pretty heavy to run model if you want on device ai's for phones I'd recommend using the 0.5B version of this model (coming soon)
|
27 |
+
|
28 |
+
|
29 |
+
To use this model, you must use a service which supports the GGUF file format.
|
30 |
+
Additionaly, this is the Prompt Template: it uses the qwen2 template.
|
31 |
+
```
|
32 |
+
{{ if .System }}<|system|>
|
33 |
+
{{ .System }}<|end|>
|
34 |
+
{{ end }}{{ if .Prompt }}<|user|>
|
35 |
+
{{ .Prompt }}<|end|>
|
36 |
+
{{ end }}<|assistant|>
|
37 |
+
{{ .Response }}<|end|>
|
38 |
+
```
|
39 |
+
|
40 |
+
Or if you are using an anti prompt: <|end|><|assistant|>
|
41 |
+
|
42 |
+
Highly recommended to use with a system prompt.
|
43 |
+
|
44 |
+
|
45 |
# Uploaded model
|
46 |
|
47 |
- **Developed by:** Pinkstack
|
48 |
- **License:** apache-2.0
|
49 |
- **Finetuned from model :** Pinkstack/PARM-V1.5-QwQ-Qwen-2.5-o1-3B-VLLM
|
50 |
|
51 |
+
This ai model was trained with [Unsloth](https://github.com/unslothai/unsloth) and Huggingface's TRL library.
|
|
|
|