Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment Paper • 2408.06266 • Published Aug 12, 2024 • 10
Direct Preference Optimization: Your Language Model is Secretly a Reward Model Paper • 2305.18290 • Published May 29, 2023 • 53
Embedding Model Collection Vietnamese Pre-trained Embedding Models • 4 items • Updated Sep 8, 2024 • 1