underscore2
/

qwen-2.5-3b-grpo-v3

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

qwen-2.5-3b-grpo-v3

1 contributor

History: 4 commits

underscore2's picture

Trained with Unsloth

ec4f8c6 verified 8 days ago