Spaces for Audio / Voices
- Running on Zero338π
- Running9π ποΈπ₯°
SBV2 Chupa Demo
- Running2πποΈπ
VisualNovel_sbv_demo
- Running on CPU Upgrade600πποΈ
Moe TTS
- Running5πΊ
Bert-VITS2 AI Abe&Suga&Kishida
- Running30π
AICoverGen
- Build error13:π€
rvc-Blue-archives-hoyogames
- Running38βΆοΈπ€
VTuber RVC Models
- Running313π
RVC Inference HF
- Running on Zero192π
AudioπΉSeparator
Vocal and background audio separator
- Running37π
BlueArchiveTTS
- Running138πππ
Multi Voice TTS(English/Chinese/Japanese)
[δΈζ/English/ζ₯ζ¬θͺ]multilingual text-to-speech
- Running on Zero346π₯
Stable Audio Open Zero
- Running129π
Applio
A simple, high-quality voice conversion tool
- Running on Zero1.4kπ£οΈ
Voice Clone
- Running on Zero137β‘
RVCβ‘ZERO
Voice conversion framework based on VITS
- Running5ππ΄
Multilingual Anime TTS
- Running1πΆ
DiffSingerπΆ Diffusion for Singing Voice Synthesis
- Running116π΅
Ultimate Vocal Remover WebUI
- Running226ππΊ
Aesthetic RVC Inference HF
- Running59β‘
Advanced RVC Inference
- Running756π
Vits Models
- Running487ππ΄
Multilingual Anime TTS
- Running32β‘
LoveLive-ShojoKageki VITS
- Running361π¨
vits-uma-genshin-honkai
- Running3πΊ
γγγγΉγζγγγ‘γΌγ«γΌοΌStyle-Bert-VITS2οΌ
- Running10πβΆοΈ
Hololive Style-Bert-VITS2
- Running on Zero439πΌπΆ
Midi Music Generator
- Running20πΌ
Japanese Lyric Generator
- Running on A10G343π
VALL E X
- Running2π₯
AIζγγγ‘γΌγ«γΌ
- Running6π
BangDream-ShojoKageki Bert VITS2
- Running3π
lovelive-ShojoKageki VITS JPZH
- Running16π
Lovelive-nijigasaki-MB-iSTFT-VITS-ZH&JP
- Running on T42.04kπΆ
Bark
- Running977π€
OpenVoice
- Running265π€
OpenVoiceV2
- Runtime error55π
ChatTTS OpenVoice
- Running on T4169ππ¦
MassivelyMultilingualTTS
- Running on T42.06kπΈ
XTTS
- Running on A10G4.5kπ΅
MusicGen
- Runtime error514π
Seamless M4T v2
- Sleeping60π
Mars5 Space
- Running on Zero8ποΈπΎππ£οΈ
FAcodecV2
- Running7ππ
Lemonfoot GPT-SoVITS
- Running on A10G209π
TTS x Hallo Talking Portrait
Generate Talking avatars from Text-to-Speech
- Running on CPU Upgrade381π€
RVC Genshin Impact
- Runtime error84π
FoleyCrafter
- Running152π
Voice Clone Multilingual
Languages ru,en,zh-cn,ja,de,fr,it,pt,pl,tr,ko,nl,cs,ar,es,hu
- Running on Zero14π¨
Talkalkai Cover
- Running on Zero435πΊ
Image to Music v2
Get a music sample inspired by the mood of an image
- Running180π
Whisper Timestamped
In-browser speech recognition w/ word-level timestamps
- Running on CPU Upgrade495π
TTS Arena
Vote on the latest TTS models!
- Running17π₯
TTSDS Benchmark and Leaderboard
Text-To-Speech (TTS) Evaluation using objective metrics.
- Sleeping6π¨
LAKH MIDI Dataset Search
Search and explore LAKH MIDI dataset with MidiCaps
- Running on Zero22π
PicoAudio
- Running13π
Advanced MIDI Search
Search and explore 179k+ MIDI titles
- Running on Zero64π
SenseVoice
- Running210π£οΈ
Whisper Speaker Diarization
- Running228π
Faster Whisper Webui
- Running on Zero25π€
Vocal Separation SOTA
- Running73π
BangDream-ShojoKageki Bert VITS2
- Running2π
BangDream-ShojoKageki Api
- Running15π
BangDream-ShojoKageki Bert VITS2
- Running13π
Efficient Audio Captioning
- Running on Zero164π
NaturalSpeech3 FACodec
- Running220π
tts Text To Speech
- Running4π
Edge Tts
- Running13π
JA TTS Arena
Vote on the top Japanese TTS models!
- Running9β‘
MIKU TTS
- Running4πΉ
Genshin music generation
- Sleeping3β‘
Advanced RVC Inference
- Sleepingπ
Style Bert VITS2 MT
- Runtime error3ποΈ
ZeroRVC
- Running7π
Edge TTS w/ More Options
- Running on Zero6π¬
ChatTTS Forge
- Runtime error33β‘
EZ Voice Clone
- Runtime error3β‘
Training Helper Rvc
easy training helper For RVC
- Running on Zero17π
Anitalker
- Running5:π€
rvc-Blue-archives
- Sleeping70π
Fish Diffusion (HiFiSinger) Demo
- Running15π₯°
Japanese Ero Voice Classifier
- Running27πποΈπ
Style Bert VITS2 Editor Demo
- Running on A10G287π
Fish Speech 1
- Running2πΉ->π΅
Piano transcription
- Sleeping1β‘
Rvc Demo
A demo of RVC pip
- Running98πΆ
Bark Voice Cloning
- Sleeping1πΈ
NeonAI Coqui AI TTS Plugin
- Runtime error105πΈ
NeonAI Coqui AI TTS Plugin
- Running138π
Qwen2 Audio Instruct Demo
- Running8π£οΈ
StyleTTS 2
Efficient, fast, and natural text to speech with StyleTTS 2!
- Runtime error12π₯
AICoverGen
- Running10π₯
Harmonic Melody MIDI Mixer
Harmonize and mix any MIDI melody
- Running7π»
MusicGen Riff
Music Generator | Song Maker Free | Lyrics Generator
- Runtime error30π΅
Ilaria Audio Analyzer
- Running on Zero657π»
Ilaria RVC
- Runtime error4π πΏ
MDX UVR
- Running on Zero72π€
GPT SoVITS V2
- Running on Zero7π£οΈ
Read My Pdf Outloud
- Running6β‘
Vocal Remover
- Running on Zero735π₯
Parler-TTS
High-fidelity Text-To-Speech
- Runtime error3π₯°
Japanese Ero Voice Classifier
- Running3π
GPT-SoVITS-ToneControl_test
- Running18π
Umamusume Bert Vits2
- Sleeping1π
Animalese Py
- Sleeping2πΆ
Animalese RVC
- Build error4π
AI Hanser
- Running on Zero155π»
Stable Audio Live Multiplayer
- Running339π
Edge TTS Text To Speech
- Running6π¨
Youtube AI Summarizer
- Sleeping3π
AICoverGen
- Running1π»
Animalese Js
- Running on CPU Upgrade1π¬
ASR Model Comparison
- Running3π₯
AICoverGenMod
- Sleeping1π¨
Ilaria Converter
- Sleeping1π
RVC UI TES
- Running8π€
RVC Genshin Impact
- Sleeping1π¦
Voice2VoiceChatbot
- Sleepingπ
RealTimeVoicetoVoiceChatbot
sp-uhh/speech-enhancement-sgmse
Audio-to-Audio β’ Updated β’ 71 β’ 7- Running2π
RVC UI
An easy-to-use voice conversion framework based on VITS.
- Sleepingπ
RVC
- Runningπ
AI Voice Assistance
- Running on Zero1π£οΈ
Voice Clone
- Sleeping5π
Optimus
- Running37π
Doc To Dialogue
Transform a report or document into an interview/discussion
- Running44β‘
Voicee
World's fastest Voice Assistant
- Running6π
Fish Audio API Demo
- Running on Zero55π
Musicgen Songstarter Demo
- Running72βΆοΈπ»πΏ
Hololive Rvc Models V2
- Running22πΌπΆ
Advanced MIDI Renderer
Transform and render any MIDI
- Running3π
Imagen POP Music Medley Diffusion Transformer
Generate POP music medley with Imagen diffusion transformer
- Sleeping2π₯
Ultimate MIDI Classifier
Classify absolutely any MIDI by genre, song and artist
- Running on Zero2π
Intelligent MIDI Comparator
Intelligently compare any pair of MIDIs
- Running85π
ChatTTS Speaker
- Sleeping2π
Bridge Music Transformer
Generate a seamless bridge between two composition parts
- Running55π
vits-simple-api
- Running10ποΈ
Bert VITS Umamusume Genshin HonkaiSR
- Running on Zero24πβ«
Audio SR
Fixed fork of the original audio sr!
- Running on Zero109π€π
Seed Voice Conversion
- Running39β‘
Mini Omni
- Running4β‘
Monophonic MIDI Melody Harmonizer
Retrieval augmented harmonization of any MIDI melody
- Running10β‘
MIDI Melody
Add a unique melody to any MIDI file
- Running3π₯
MIDI Chords Mixer
Mix chords from one MIDI to another MIDI
- Sleeping3π
Morse To Audio
- Sleeping1π
RCV EASY GUI
- Sleeping1β‘
Advanced RVC Inference
- Running2β‘
Lyricsgenius
Get Lyrics from Genius's Link
- Sleeping1π
Groq Gradio Voice Assistant
- Sleeping2π
Hex Separator
- Sleeping2π
Groq API Models
Groq API Playground
- Running on Zero8π
GPT-SoVITS-V2-NIIMI SORA
- Running on A10G2π΅
AI Tube Engine MusicGen
- Running on A10G1π΅
AI Tube Engine MusicGen
- Sleeping1π΅
AI Tube Engine MusicGen
- Running on A10G5π΅
AI Tube Engine MusicGen
- Running on Zero13π
GPT-SoVITS-V2-Gakuen Idolmaster
- Running on Zero6π
UTMOSv2
- Runtime error5β‘
Mini Omni
- Running on Zero4π
GPT-SoVITS-V2-misc_models
- Configuration error12π
Bench.audio
LMSYS bench for audio agents
- Runtime error77π
Compressed Wav2Lip
- Running76π
Gradio Lipsync Wav2lip
- Running on Zero5π¨
EchoMimic
- Running3π»
RVC GUI
- Running20π
Wav2lip Gpu
- Running1π
Matcha TTS Japanese
Description of Matcha TTS Japanese
- Running82π©
DeepFilterNet2
- Running on Zero12π«π·π₯
French Parler-TTS
High-fidelity Text-To-Speech
- Running on Zero245π£
EzAudio
- Running on T413π₯
Kotoba Whisper Demo
- Running1π¦
Matcha Tts Onnx Benchmarks
Benchmark load model and tts time
- Sleeping7β‘
Mini Omni
- Running on Zero2π
AIChat-matcha-tts-onnx-en
Give your space a voice! (Demo)
- Running on Zero11π
GAMA
- Running on Zero3π
GAMA-IT
- Sleeping1π¦
Sbv2 Py
- Running on Zero205πΆ
OpenMusic
- Running59ποΈ
PodcastGen
Generate a 2-speaker podcast from text input or documents!
- Running3π
Mistral 7B Instruct v0.3 Matcha-TTS English
Enjoy TTS Chat
- Sleeping2π¨
Moshi
- Running on Zero41π£
EzAudio ControlNet
- Running3π
Fish Audio API Demo
- Runtime error1π
Whisper En Tiny
- Running on Zero7π
Guided Rock Music Transformer
Controlled source augmented rock music transformer
- Running on Zero18π·
Long-form MusicGen
Long-form Musicgen
- Running68π»
Multilingual TTS
- Running3π₯
AIε²Έη°ζιγ‘γΌγ«γΌ
- Running1π₯
AIθ ηΎ©εγ‘γΌγ«γΌ
- Runtime error1π»
Audio Mouth
- Runtime error372π
Pdf2audio
- Running on CPU Upgrade505π
Open ASR Leaderboard
- Running on T4921ποΈ
Open NotebookLM
Personalised Podcasts For All - Available in 13 Languages
- Running on Zero4π₯
Kotoba Whisper Bilingual Demo
- Running on T4386π£οΈ
MeloTTS
Fast, efficient, & multilingual text-to-speech
- Sleeping173π€
Canary 1b
- Running1π»
Style Bert VITS2 SW
- Runtime error21π
Llama 3.2 3b Voice
- Runtime error1π
Pdf2audio
- Running on Zero585π€―
Whisper Turbo
- Running on Zero254π€―
Realtime Whisper Turbo
Realtime implementation of Whisper large turbo
- Running116π
Whisper Large V3 Turbo WebGPU
ML-powered speech recognition directly in your browser
- Running on T4238π’
Tortoise Tts
ExpressivText-to-Speech
- Running29π»
Russian Text To Speech
- Sleeping5π
Yt-dlp Wav
- Running on T4268πΌ
UnlimitedMusicGen
unlimited Audio generation with a few added features
- Runtime error84πΆ
AudioCraft Plus v2.0.0a (MusicGen + AudioGen)
- Runtime error22πΌ
MusicGen+ V1.2.7 (HuggingFace Version)
- Running on Zero57π’
VoiceRestore
- Running on Zero3β‘
Whisperturbo
whisper3 turbo
- Running24ποΈ
GPT-SoVITS-3s-cloning-free-TTS
- Running2πΊ
γγγγΉγη³η ΄θγ‘γΌγ«γΌοΌStyle-Bert-VITS2οΌ
- Running1πΊ
γγγγΉγδΊιδΏεγ‘γΌγ«γΌ
- Runtime error3π
Text To Meow
- Runtime error4π₯
Rvc Ui
- Running23π
Reverb ASR Demo
- Running on Zero6π
Diva Audio
- Sleeping1π»
Ilaria RVC Mod
- Running on T4281π
Resemble Enhance
- Running1π»
Openai Whisper Large V3 Turbo
- Running42π»
RVC PlayGround
- Running36π
Podcastfy.ai - An Open Source alternative to NotebookLM's podcast feature
- Running on Zero64ποΈπΊ
Video to Music
Generate and apply matching music background to video shot
- Running165πποΈ
Video SoundFX
Generates a sound effect that matches video shot
- Paused172π
Image2SFX Comparison
Generates audio environment from an image
- Running on Zero163π
Applio
- Running on Zero1.17kπ£οΈ
F5-TTS
F5-TTS & E2-TTS: Zero-Shot Voice Cloning (Unofficial Demo)
- Running1π
Heartbeat
- Running90π€π
TTS Spaces Arena
Vote on the top HF TTS models!
- Running on CPU Upgrade61π§ββοΈπ§ββοΈπ§ββοΈ
xVASynth TTS
CPU powered, low RTF, emotional, multilingual TTS
- Running275πΆ
β AI Jukebox β
Generate music powered by AI
- Running on L40S294π
TANGO
Co-Speech Gesture Video Generation
- Running on Zero58π’
Ichigo Llama3.1 S Instruct
- Running on Zero6π
Whisper Japanese Phone Demo
Whisper model to transcript japanese audio to katakana.
- Running19π₯π
CoverGen
- Running20β«π
Audio Steganography
- Running14π₯
AICoverGenMod
- Running8π
UVR UI
- Running on Zero15π£οΈ
Diva Realtime Chat
- Sleeping2π
Kotoba Whisper Diarization Demo
- Running on Zero6π
Synthio Stable Audio Open
Stable audio open model from Synthio paper.
- Running1π
RYO EVC
- Runtime error1π»
UVR
- Running on Zero32π
Moonshine ASR
Fast & efficient ASR outperforming Whisper!
- Runtime error18π
seewav-gui
- Running on Zero69π΅
RWKV Music
Generate MIDI music using RWKV v4!
- Running3π»
MP3 Transcribe
Whisper Transcribe MP3 files, use a GPU to convert faster!
- Running2π£οΈ0οΈβ£
StyleTTS 2 Zero
Efficient, fast, and natural text to speech with StyleTTS 2!
- Running on Zero215π»
MaskGCT TTS Demo
MaskGCT TTS Demo
- Running on Zero533π€«
Whisper Large V3
- Running on Zero3π
Ultimate Chords Progressions Transformer
Self-correcting multi-instrumental chords transformer
- Runtime error8πΆβ«
Chords Progressions Transformer
Chords-conditioned music transformer
- Running on Zero14β‘
Fast Whisper Turbo
Ultra-fast Whisper Turbo inference β‘
- Sleeping284π
AudioLDM2 Text2Audio Text2Music Generation
- Running2π£οΈπ
Hey Buddy!
In-Browser Audio Wake-Word Spotting
- Running3πΉ
Streamlit Pianoroll
Streamlit pianoroll playback element
- Running5β‘
PolUVR
Audio-Separator by Politrees
- Running on Zero55π
Giant Music Transformer
Fast multi-instrumental music transformer
- Sleeping17π
Omni Mini (WebRTC)
- Running5πΉ
Fortepyan Datasets
Streamlit browser for piano music datasets.
- Running4πΉ
PIANO Dataset
Demo of masking tasks from the PIANO dataset
- Running on L40S91π¬
Fish Agent
An end-to-end (e2e) Voice Language Model by Fish Audio.
- Running5π΅
Audio to Stems to MIDI Converter
- Running on CPU Upgrade11π
Podcast Generation
Generate podcasts with AI avatars
- Sleepingπ
ChatTTS OpenVoice
- Running1π
OpenVoice
- Running on Zero4π£οΈ
F5-TTS
F5-TTS & E2-TTS: Zero-Shot Voice Cloning (Unofficial Demo)
- Running309π
Bark with Voice Cloning
- Running11π
OuteTTS 0.1 350M Demo
- Running on Zero2πΌπΆ
Midi Music Generator
- Running1π΅
Audio Lyrics Extractor