MJ-Bench: Is Your Multimodal Reward Model Really a Good Judge for Text-to-Image Generation?
Paper
•
2407.04842
•
Published
•
52
Note A novel benchmark which incorporates a comprehensive preference dataset to evaluate multimodal judges in providing feedback for image generation models across four key perspectives: Alignment, Safety, Image Quality (Aesthetics and Artifacts), and Bias.