@ariG23498 on Hugging Face: "Tried my hand at simplifying the derivations of Direct Preference…"

Hugging Face

Join the conversation

Join the community of Machine Learners and AI enthusiasts.

Back to feed

ariG23498

posted an update Jan 19

Post

2081

Tried my hand at simplifying the derivations of Direct Preference Optimization.

I cover how one can reformulate RLHF into DPO. The idea of implicit reward modeling is chef's kiss.

Blog: https://huggingface.co/blog/ariG23498/rlhf-to-dpo

deleted

Jan 19

This comment has been hidden

In this post

ariG23498 Aritra Roy Gosthipaty