MinMo: A Multimodal Large Language Model for Seamless Voice Interaction Paper • 2501.06282 • Published 17 days ago • 40
CosyVoice 2: Scalable Streaming Speech Synthesis with Large Language Models Paper • 2412.10117 • Published Dec 13, 2024 • 2
alibaba-damo/audio_codec-freqcodec_magphase-en-libritts-16k-gr8nq32ds320-pytorch Updated Oct 12, 2023 • 5 • 1
alibaba-damo/audio_codec-freqcodec_magphase-en-libritts-16k-gr1nq32ds320-pytorch Updated Oct 12, 2023 • 5
PolyLM: An Open Source Polyglot Large Language Model Paper • 2307.06018 • Published Jul 12, 2023 • 26