Highly advanced based model for training:

  • Sequence Length: 131072
  • Parm 2 ultra: trained for on OpenO1 chats, sonnet 3.5, qwq messages.
Downloads last month
33
Safetensors
Model size
3.09B params
Tensor type
FP16
·
Inference Providers NEW
This model is not currently available via any of the supported Inference Providers.

Model tree for Pinkstack/PARM-v2-ULTRA-o1-3B-vLLM

Base model

Qwen/Qwen2.5-3B
Finetuned
(1)
this model

Collection including Pinkstack/PARM-v2-ULTRA-o1-3B-vLLM