Pendrokar commited on
Commit
a1a712f
·
verified ·
1 Parent(s): ff01c29

Link to dataset

Browse files
Files changed (1) hide show
  1. README.md +8 -7
README.md CHANGED
@@ -29,15 +29,16 @@ language:
29
  - sw
30
  - yo
31
  - wo
32
- thumbnail: >-
33
- https://raw.githubusercontent.com/DanRuta/xVA-Synth/master/assets/x-icon.png
34
  library: xvasynth
35
  tags:
36
- - emotion
37
- - audio
38
- - text-to-speech
39
- - tts
40
  pipeline_tag: text-to-speech
 
 
41
  ---
42
 
43
  xVASynth's xVAPitch (v3) type of voice models based on NVIDIA HIFI NeMo datasets.
@@ -54,4 +55,4 @@ xVAPitch model referenced Papers:
54
  - SDP - https://arxiv.org/pdf/2106.06103.pdf
55
  - Spline Flow - https://arxiv.org/abs/1906.04032
56
 
57
- Legal note: Although these datasets are licensed as CC BY 4.0, the base v3 model that these are fine-tuned from, was pre-trained on non-permissive data.
 
29
  - sw
30
  - yo
31
  - wo
32
+ thumbnail: https://raw.githubusercontent.com/DanRuta/xVA-Synth/master/assets/x-icon.png
 
33
  library: xvasynth
34
  tags:
35
+ - emotion
36
+ - audio
37
+ - text-to-speech
38
+ - tts
39
  pipeline_tag: text-to-speech
40
+ datasets:
41
+ - MikhailT/hifi-tts
42
  ---
43
 
44
  xVASynth's xVAPitch (v3) type of voice models based on NVIDIA HIFI NeMo datasets.
 
55
  - SDP - https://arxiv.org/pdf/2106.06103.pdf
56
  - Spline Flow - https://arxiv.org/abs/1906.04032
57
 
58
+ Legal note: Although these datasets are licensed as CC BY 4.0, the base v3 model that these models are fine-tuned from, was pre-trained on non-permissive data.