This model is created for the CS329X class HW1, and was trained on 270 self-annotated preference pairs based on PRISM questions.
Base model