arxiv:2501.12599
ZhangDehao
ispoon
AI & ML interests
None yet
Recent Activity
authored
a paper
1 day ago
Kimi k1.5: Scaling Reinforcement Learning with LLMs
authored
a paper
8 months ago
OpenRLHF: An Easy-to-use, Scalable and High-performance RLHF Framework
Organizations
None yet