DeepSeek-R1-Distill-Qwen-7B-GRPO_Math / generation_config.json

Commit History

Model save
6c50d3f
verified

Dongwei commited on