Pendrokar
/

xvapitch_nvidia

Model card Files Files and versions Community

Pendrokar commited on May 18, 2024

Commit

a1a712f

·

verified ·

1 Parent(s): ff01c29

Link to dataset

Files changed (1) hide show

README.md +8 -7

README.md CHANGED Viewed

@@ -29,15 +29,16 @@ language:
 - sw
 - yo
 - wo
-thumbnail: >-
-  https://raw.githubusercontent.com/DanRuta/xVA-Synth/master/assets/x-icon.png
 library: xvasynth
 tags:
-  - emotion
-  - audio
-  - text-to-speech
-  - tts
 pipeline_tag: text-to-speech
 ---
 xVASynth's xVAPitch (v3) type of voice models based on NVIDIA HIFI NeMo datasets.
@@ -54,4 +55,4 @@ xVAPitch model referenced Papers:
 - SDP - https://arxiv.org/pdf/2106.06103.pdf
 - Spline Flow - https://arxiv.org/abs/1906.04032
-Legal note: Although these datasets are licensed as CC BY 4.0, the base v3 model that these are fine-tuned from, was pre-trained on non-permissive data.

 - sw
 - yo
 - wo
+thumbnail: https://raw.githubusercontent.com/DanRuta/xVA-Synth/master/assets/x-icon.png
 library: xvasynth
 tags:
+- emotion
+- audio
+- text-to-speech
+- tts
 pipeline_tag: text-to-speech
+datasets:
+- MikhailT/hifi-tts
 ---
 xVASynth's xVAPitch (v3) type of voice models based on NVIDIA HIFI NeMo datasets.
 - SDP - https://arxiv.org/pdf/2106.06103.pdf
 - Spline Flow - https://arxiv.org/abs/1906.04032
+Legal note: Although these datasets are licensed as CC BY 4.0, the base v3 model that these models are fine-tuned from, was pre-trained on non-permissive data.