javirandor's picture
Create README.md
52f7538 verified
|
raw
history blame contribute delete
No virus
332 Bytes

Poisoned Reward Model

This reward model was used to align this generation model for the trojan detection competition co-located at SaTML 2024. For more information, visit the official competition website