Transcribe or translate audio from files, microphone, or YouTube
Generate and convert speech using text and audio inputs
Generate images from text descriptions