arxiv:2501.10799
Di Jin
jindi
AI & ML interests
None yet
Recent Activity
authored
a paper
2 days ago
Step-KTO: Optimizing Mathematical Reasoning through Stepwise Binary
Feedback
Organizations
Papers
1
models
None public yet
datasets
None public yet