More advanced and challenging multi-task evaluation
A model giving fine-grained scores on video quality