fish-diffusion / config.yaml
lengyue233's picture
Add ALYS + voice provider credits (#6)
89fac6a
readme: |
# Fish Diffusion - HiFiSinger Demo 🎀
GitHub Repo: [fishaudio/fish-diffusion](https://github.com/fishaudio/fish-diffusion)
To share a new model, please check out the [Share Your Model](https://huggingface.co/spaces/fishaudio/fish-diffusion/discussions/2) discussion.
max_mixing_speakers: 3
models:
- name: "M4Singer Pretrained (Many Speakers, Alto, Tenor, Soprano, Bass)"
config: configs/M4Singer.py
checkpoint: checkpoints/M4Singer.ckpt
readme: |
This model is trained on the Opencpop and M4Singer dataset and released under the [CC-BY-NC-SA 4.0](https://creativecommons.org/licenses/by-nc-sa/4.0/) license.
It contains more than 20 speakers and is thus a good playground for timbre mixing.
default_speaker: "opencpop"
- name: "Tohoku Kiritan (Feminine)"
config: configs/Kiritan.py
checkpoint: checkpoints/Kiritan.ckpt
readme: |
This model is trained on the Tohoku Kiritan dataset and released under the [CC-BY-NC-SA 4.0](https://creativecommons.org/licenses/by-nc-sa/4.0/) license.
It has a cute, yet powerful voice. CV: Akaneya Himika
default_speaker: "kiritan"
- name: "Tohoku Itako (Feminine)"
config: configs/Itako.py
checkpoint: checkpoints/Itako.ckpt
readme: |
This model is trained on the Tohoku Itako dataset and released under the [CC-BY-NC-SA 4.0](https://creativecommons.org/licenses/by-nc-sa/4.0/) license.
It has a bright and whispery voice. CV: Kido Ibuki
default_speaker: "itako"
- name: "No.7 (Feminine)"
config: configs/Seven.py
checkpoint: checkpoints/Seven.ckpt
readme: |
This model is trained on the No.7 dataset and released under the [CC-BY-NC-SA 4.0](https://creativecommons.org/licenses/by-nc-sa/4.0/) license.
It has a strong and sharp voice. CV: Koiwai Kotori
default_speaker: "seven"
- name: "Yoko (Feminine)"
config: configs/Yoko.py
checkpoint: checkpoints/Yoko.ckpt
readme: |
This model is trained on the Sinsy-f00001 dataset and released under the [CC-BY-NC-SA 4.0](https://creativecommons.org/licenses/by-nc-sa/4.0/) license.
It has a powerful, tense, and relaxed voice.
default_speaker: "yoko"
- name: "JSUT (Feminine)"
config: configs/JSUT.py
checkpoint: checkpoints/JSUT.ckpt
readme: |
This model is trained on the JSUT-song dataset and released under the [CC-BY-NC-SA 4.0](https://creativecommons.org/licenses/by-nc-sa/4.0/) license.
It has a moist and transparent voice.
default_speaker: "jsut"
- name: "CSD (Feminine)"
config: configs/CSD.py
checkpoint: checkpoints/CSD.ckpt
readme: |
This model is trained on the Children's Song Dataset and released under the [CC-BY-NC-SA 4.0](https://creativecommons.org/licenses/by-nc-sa/4.0/) license.
It has a sweet and tender voice.
default_speaker: "csd"
- name: "Namine Ritsu (Feminine)"
config: configs/Ritsu.py
checkpoint: checkpoints/Ritsu.ckpt
readme: |
This model is trained on the Namine Ritsu ENUNU Dataset and released under the [CC-BY-NC-SA 4.0](https://creativecommons.org/licenses/by-nc-sa/4.0/) license.
It has a powerful and throaty voice. CV: Canon
default_speaker: "ritsu"
- name: "S (Masculine)"
config: configs/S.py
checkpoint: checkpoints/S.ckpt
readme: |
This model is trained on a datset known as S and released under the [CC-BY-NC-SA 4.0](https://creativecommons.org/licenses/by-nc-sa/4.0/) license.
It has a nasally but powerful voice.
default_speaker: "s"
- name: "C (Feminine)"
config: configs/C.py
checkpoint: checkpoints/C.ckpt
readme: |
This model is trained on a datset known as C and released under the [CC-BY-NC-SA 4.0](https://creativecommons.org/licenses/by-nc-sa/4.0/) license.
It has a whispery, fluttery voice.
default_speaker: "c"
- name: "Azure Cobalt (Feminine)"
config: configs/Azure.py
checkpoint: checkpoints/Azure.ckpt
readme: |
This model is trained on a dataset known as Azure Cobalt and released under the [CC-BY-NC-SA 4.0](https://creativecommons.org/licenses/by-nc-sa/4.0/) license.
It has a stable, mature voice. CV: Aster
default_speaker: "azure"
- name: "ALYS (Feminine)"
config: configs/ALYS.py
checkpoint: checkpoints/ALYS.ckpt
readme: |
This model is trained on the ALYS DB 001 JPN dataset, originally produced by Voxwave and released under the [GPL-3.0](https://choosealicense.com/licenses/gpl-3.0/) license.
It has a slightly soft voice. CV: Poucet
default_speaker: "ALYS"