Generate images from text descriptions
Generate detailed prompts for Stable Diffusion
Transcribe or translate audio from files, microphone, or YouTube