rasdani
/

qwen2-math-7b-step-dpo

Text Generation

alignment-handbook

Generated from Trainer

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

qwen2-math-7b-step-dpo / tokenizer.json

rasdani's picture

Training in progress, step 400

88a3813 verified 21 days ago

history contribute delete

No virus

7.03 MB

File too large to display, you can check the raw version instead.