Pinkstack's picture
Update README.md
8b59af1 verified
|
raw
history blame
963 Bytes
metadata
base_model: Pinkstack/PARM-V1-phi-4-4k-CoT-pytorch
tags:
  - text-generation-inference
  - transformers
  - unsloth
  - llama
  - gguf
  - code
  - phi3
  - cot
  - o1
  - reasoning
  - cot
license: mit
language:
  - en
pipeline_tag: text-generation

this is out flagship model, with top-tier reasoning, rivaling gemini-flash-exp-2.0-thinking and o1 mini. results are overall similar to both of them, we are not comparing to qwq as it has much longer results which waste tokens. the model uses this prompt: (modified phi-4 prompt)

{{ if .System }}<|system|>
{{ .System }}<|im_end|>
{{ end }}{{ if .Prompt }}<|user|>
{{ .Prompt }}<|im_end|>
{{ end }}<|assistant|>{{ .CoT }}<|CoT|>
{{ .Response }}<|FinalAnswer|><|im_end|>

Uploaded model

  • Developed by: Pinkstack
  • License: MIT
  • Finetuned from model : Pinkstack/PARM-V1-phi-4-4k-CoT-pytorch

This Phi-4 model was trained with Unsloth and Huggingface's TRL library.