Automated Generation of Challenging Multiple-Choice Questions for Vision Language Model Evaluation Paper โข 2501.03225 โข Published Jan 6 โข 7 โข 2