Link to dataset
Browse files
README.md
CHANGED
@@ -29,15 +29,16 @@ language:
|
|
29 |
- sw
|
30 |
- yo
|
31 |
- wo
|
32 |
-
thumbnail:
|
33 |
-
https://raw.githubusercontent.com/DanRuta/xVA-Synth/master/assets/x-icon.png
|
34 |
library: xvasynth
|
35 |
tags:
|
36 |
-
|
37 |
-
|
38 |
-
|
39 |
-
|
40 |
pipeline_tag: text-to-speech
|
|
|
|
|
41 |
---
|
42 |
|
43 |
xVASynth's xVAPitch (v3) type of voice models based on NVIDIA HIFI NeMo datasets.
|
@@ -54,4 +55,4 @@ xVAPitch model referenced Papers:
|
|
54 |
- SDP - https://arxiv.org/pdf/2106.06103.pdf
|
55 |
- Spline Flow - https://arxiv.org/abs/1906.04032
|
56 |
|
57 |
-
Legal note: Although these datasets are licensed as CC BY 4.0, the base v3 model that these are fine-tuned from, was pre-trained on non-permissive data.
|
|
|
29 |
- sw
|
30 |
- yo
|
31 |
- wo
|
32 |
+
thumbnail: https://raw.githubusercontent.com/DanRuta/xVA-Synth/master/assets/x-icon.png
|
|
|
33 |
library: xvasynth
|
34 |
tags:
|
35 |
+
- emotion
|
36 |
+
- audio
|
37 |
+
- text-to-speech
|
38 |
+
- tts
|
39 |
pipeline_tag: text-to-speech
|
40 |
+
datasets:
|
41 |
+
- MikhailT/hifi-tts
|
42 |
---
|
43 |
|
44 |
xVASynth's xVAPitch (v3) type of voice models based on NVIDIA HIFI NeMo datasets.
|
|
|
55 |
- SDP - https://arxiv.org/pdf/2106.06103.pdf
|
56 |
- Spline Flow - https://arxiv.org/abs/1906.04032
|
57 |
|
58 |
+
Legal note: Although these datasets are licensed as CC BY 4.0, the base v3 model that these models are fine-tuned from, was pre-trained on non-permissive data.
|