FuseAI
/

FuseO1-DeepSeekR1-QwQ-SkyT1-32B-Preview

Model card Files Files and versions Community

Resources

View closed (2)

Add comparison with 70B distilled R1 model

#8 opened 1 day ago by

Update model card

#7 opened 8 days ago by

Temperature's effect on the performance of long chain reasoning models. Why was 0.7 used for the evals?

#6 opened 26 days ago by

License of your model

#4 opened about 1 month ago by

Evaluation

#3 opened about 1 month ago by

Merge with 32b coder?

#2 opened about 1 month ago by