Detect objects in images or videos
Generate personalized images with a face preservation
Generate realistic audio from text