Generate text descriptions from images
Convert text to voice using a musical model
Transform images into pixel art style