F5-TTS & E2-TTS: Zero-Shot Voice Cloning (Unofficial Demo)
Translate spoken words to English speech
Generate conversation responses using Mixtral 8x7b model