Demo of GOT-OCR 2.0's Transformers implementation
Generate speech from text
Generate text responses using images and text prompts