metadata
base_model: Pinkstack/PARM-V1-phi-4-4k-CoT-pytorch
tags:
- text-generation-inference
- transformers
- unsloth
- llama
- gguf
- code
- phi3
- cot
- o1
- reasoning
- cot
license: mit
language:
- en
pipeline_tag: text-generation
this is out flagship model, with top-tier reasoning, rivaling gemini-flash-exp-2.0-thinking and o1 mini. results are overall similar to both of them, we are not comparing to qwq as it has much longer results which waste tokens. the model uses this prompt: (modified phi-4 prompt)
{{ if .System }}<|system|>
{{ .System }}<|im_end|>
{{ end }}{{ if .Prompt }}<|user|>
{{ .Prompt }}<|im_end|>
{{ end }}<|assistant|>{{ .CoT }}<|CoT|>
{{ .Response }}<|FinalAnswer|><|im_end|>
Uploaded model
- Developed by: Pinkstack
- License: MIT
- Finetuned from model : Pinkstack/PARM-V1-phi-4-4k-CoT-pytorch
This Phi-4 model was trained with Unsloth and Huggingface's TRL library.