Self-Play Preference Optimization for Language Model Alignment Paper • 2405.00675 • Published May 1 • 25
AntiLeak-Bench: Preventing Data Contamination by Automatically Constructing Benchmarks with Updated Real-World Knowledge Paper • 2412.13670 • Published 6 days ago • 4