phi-4-14B-grpo-limo-2e / model-00004-of-00006.safetensors

Commit History

Trained with Unsloth
d5f4c4e
verified

mrm8488 commited on