Extract and crop people, heads, and half-bodies from images
CPU powered, low RTF, emotional, multilingual TTS
Identify emotions in spoken words