Collections

Discover the best community collections!

Collections including paper arxiv:1909.08593
Papers - Reward Model
Collection by Apr 19
Papers - OpenAI
Collection by Jun 12
Papers - Fine-tuning - DPO
Refer to additional papers: https://link.springer.com/article/10.1007/s10994-014-5458-8 and https://link.springer.com/article/10.1007/BF00992696