Generate text by combining an image and a question
Select coordinates on an image based on instructions
Upload documents to answer questions