Update README.md
Browse files
README.md
CHANGED
@@ -17,7 +17,26 @@ Source code for phoneme alignment: https://github.com/sinhprous/F5-TTS/blob/main
|
|
17 |
Source code for duration predictor: https://github.com/sinhprous/F5-TTS/blob/main/src/f5_tts/model/duration_predictor.py
|
18 |
|
19 |
## Audio samples
|
|
|
20 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
21 |
|
22 |
## License
|
23 |
This model is released under the Creative Commons Attribution Non Commercial Share Alike 4.0 license, which allows for free usage, modification, and distribution
|
|
|
17 |
Source code for duration predictor: https://github.com/sinhprous/F5-TTS/blob/main/src/f5_tts/model/duration_predictor.py
|
18 |
|
19 |
## Audio samples
|
20 |
+
Outputs from original model was generated using https://huggingface.co/spaces/mrfakename/E2-F5-TTS
|
21 |
|
22 |
+
Data - driven AI systems said, "Key data is the key, data is key, data is key, data is the key, and the key to the data is key, the data key is the key to the data that is key to the key". Can you keep up?
|
23 |
+
|
24 |
+
Original model: (skipping words)
|
25 |
+
<audio controls>
|
26 |
+
<source src="https://huggingface.co/sinhprous/F5TTS-stabilized-LJSpeech/blob/main/audio_samples/sample_origin_1.wav" type="audio/mp3">
|
27 |
+
Your browser does not support the audio element.
|
28 |
+
</audio>
|
29 |
+
|
30 |
+
Finetuned model:
|
31 |
+
<audio controls>
|
32 |
+
<source src="https://huggingface.co/sinhprous/F5TTS-stabilized-LJSpeech/blob/main/audio_samples/sample_aligned_1.wav" type="audio/mp3">
|
33 |
+
Your browser does not support the audio element.
|
34 |
+
</audio>
|
35 |
+
|
36 |
+
Buffalo buffalo Buffalo buffalo buffalo buffalo Buffalo buffalo.
|
37 |
+
|
38 |
+
|
39 |
+
Call one two three - one two three - one two three four who call one two three - one two three - one two three four who call one two three - one two three - one two three four who call one two three - one two three - one two three four.
|
40 |
|
41 |
## License
|
42 |
This model is released under the Creative Commons Attribution Non Commercial Share Alike 4.0 license, which allows for free usage, modification, and distribution
|