G2P
Upgraded to v1.0!
An end-to-end (e2e) Voice Language Model by Fish Audio.
Whisper model to transcript japanese audio to katakana.