Pinkstack
/

PARM-V2-QwQ-Qwen-2.5-o1-3B-GGUF

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Pinkstack commited on 28 days ago

Commit

e057edf

·

verified ·

1 Parent(s): e0e26fe

Update README.md

Files changed (1) hide show

README.md +32 -3

README.md CHANGED Viewed

@@ -6,17 +6,46 @@ tags:
 - unsloth
 - qwen2
 - gguf
 license: apache-2.0
 language:
 - en
 ---
 # Uploaded  model
 - **Developed by:** Pinkstack
 - **License:** apache-2.0
 - **Finetuned from model :** Pinkstack/PARM-V1.5-QwQ-Qwen-2.5-o1-3B-VLLM
-This qwen2 model was trained 2x faster with [Unsloth](https://github.com/unslothai/unsloth) and Huggingface's TRL library.
-[<img src="https://raw.githubusercontent.com/unslothai/unsloth/main/images/unsloth%20made%20with%20love.png" width="200"/>](https://github.com/unslothai/unsloth)

 - unsloth
 - qwen2
 - gguf
+- Reasoning
+- o1
+- qwq
 license: apache-2.0
 language:
 - en
+pipeline_tag: text-generation
 ---
+[Pinkstack.png](https://cdn-uploads.huggingface.co/production/uploads/6710ba6af1279fe0dfe33afe/2xMulpuSlZ3C1vpGgsAYi.png)
+ 🧀 Which quant is right for you?
+- ***Q4:*** This model should be used on edge devices like high end phones or laptops due to its very compact size, quality is okay but fully usable.
+- ***Q8:*** This model should be used on most high end modern devices like rtx 3080, Responses are very high quality, but its noticeably slower than q4
+This Parm v2 is based on Qwen 2.5 3B which has gotten many extra reasoning training parameters so it would have similar outputs to qwen QwQ / O.1 mini (only much, smaller.), We trained using [this](https://huggingface.co/datasets/gghfez/QwQ-LongCoT-130K-cleaned) dataset, opus dataset and sonnet 3.5 dataset from huggingface.
+This is a pretty heavy to run model if you want on device ai's for phones I'd recommend using the 0.5B version of this model (coming soon)
+To use this model, you must use a service which supports the GGUF file format.
+Additionaly, this is the Prompt Template: it uses the qwen2 template.
+```
+{{ if .System }}<|system|>
+{{ .System }}<|end|>
+{{ end }}{{ if .Prompt }}<|user|>
+{{ .Prompt }}<|end|>
+{{ end }}<|assistant|>
+{{ .Response }}<|end|>
+```
+Or if you are using an anti prompt: <|end|><|assistant|>
+Highly recommended to use with a system prompt.
 # Uploaded  model
 - **Developed by:** Pinkstack
 - **License:** apache-2.0
 - **Finetuned from model :** Pinkstack/PARM-V1.5-QwQ-Qwen-2.5-o1-3B-VLLM
+This ai model was trained with [Unsloth](https://github.com/unslothai/unsloth) and Huggingface's TRL library.