Generate and convert speech using text and audio inputs
Generate audio from text using voice synthesis