metadata

base_model: Pinkstack/PARM-V1-phi-4-4k-CoT-pytorch
tags:
  - text-generation-inference
  - transformers
  - unsloth
  - llama
  - gguf
  - code
  - phi3
  - cot
  - o1
  - reasoning
  - cot
license: mit
language:
  - en
pipeline_tag: text-generation

this is out flagship model, with top-tier reasoning, rivaling gemini-flash-exp-2.0-thinking and o1 mini. results are overall similar to both of them, we are not comparing to qwq as it has much longer results which waste tokens. the model uses this prompt: (modified phi-4 prompt)

{{ if .System }}<|system|>
{{ .System }}<|im_end|>
{{ end }}{{ if .Prompt }}<|user|>
{{ .Prompt }}<|im_end|>
{{ end }}<|assistant|>{{ .CoT }}<|CoT|>
{{ .Response }}<|FinalAnswer|><|im_end|>

Uploaded model

Developed by: Pinkstack
License: MIT
Finetuned from model : Pinkstack/PARM-V1-phi-4-4k-CoT-pytorch

This Phi-4 model was trained with Unsloth and Huggingface's TRL library.