Update README.md
Browse files
README.md
CHANGED
@@ -18,10 +18,11 @@ Source code for duration predictor: https://github.com/sinhprous/F5-TTS/blob/mai
|
|
18 |
|
19 |
## Audio samples
|
20 |
Outputs from original model was generated using https://huggingface.co/spaces/mrfakename/E2-F5-TTS
|
|
|
21 |
|
22 |
-
Data - driven AI systems said, "Key data is the key, data is key, data is key, data is the key, and the key to the data is key, the data key is the key to the data that is key to the key". Can you keep up?
|
23 |
|
24 |
-
Original model:
|
25 |
<audio controls>
|
26 |
<source src="https://huggingface.co/sinhprous/F5TTS-stabilized-LJSpeech/resolve/main/audio_samples/sample_origin_1.wav" type="audio/mp3">
|
27 |
Your browser does not support the audio element.
|
@@ -33,10 +34,35 @@ Finetuned model:
|
|
33 |
Your browser does not support the audio element.
|
34 |
</audio>
|
35 |
|
36 |
-
Buffalo buffalo Buffalo buffalo buffalo buffalo Buffalo buffalo.
|
37 |
|
|
|
38 |
|
39 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
40 |
|
41 |
## License
|
42 |
This model is released under the Creative Commons Attribution Non Commercial Share Alike 4.0 license, which allows for free usage, modification, and distribution
|
|
|
18 |
|
19 |
## Audio samples
|
20 |
Outputs from original model was generated using https://huggingface.co/spaces/mrfakename/E2-F5-TTS
|
21 |
+
The original model usually skips words in these hard texts..
|
22 |
|
23 |
+
*Data - driven AI systems said, "Key data is the key, data is key, data is key, data is the key, and the key to the data is key, the data key is the key to the data that is key to the key". Can you keep up? *
|
24 |
|
25 |
+
Original model:
|
26 |
<audio controls>
|
27 |
<source src="https://huggingface.co/sinhprous/F5TTS-stabilized-LJSpeech/resolve/main/audio_samples/sample_origin_1.wav" type="audio/mp3">
|
28 |
Your browser does not support the audio element.
|
|
|
34 |
Your browser does not support the audio element.
|
35 |
</audio>
|
36 |
|
|
|
37 |
|
38 |
+
*Buffalo buffalo Buffalo buffalo buffalo buffalo Buffalo buffalo.*
|
39 |
|
40 |
+
Original model:
|
41 |
+
<audio controls>
|
42 |
+
<source src="https://huggingface.co/sinhprous/F5TTS-stabilized-LJSpeech/resolve/main/audio_samples/sample_origin_2.wav" type="audio/mp3">
|
43 |
+
Your browser does not support the audio element.
|
44 |
+
</audio>
|
45 |
+
|
46 |
+
Finetuned model:
|
47 |
+
<audio controls>
|
48 |
+
<source src="https://huggingface.co/sinhprous/F5TTS-stabilized-LJSpeech/resolve/main/audio_samples/sample_aligned_2.wav" type="audio/mp3">
|
49 |
+
Your browser does not support the audio element.
|
50 |
+
</audio>
|
51 |
+
|
52 |
+
|
53 |
+
*Call one two three - one two three - one two three four who call one two three - one two three - one two three four who call one two three - one two three - one two three four who call one two three - one two three - one two three four.*
|
54 |
+
|
55 |
+
Original model:
|
56 |
+
<audio controls>
|
57 |
+
<source src="https://huggingface.co/sinhprous/F5TTS-stabilized-LJSpeech/resolve/main/audio_samples/sample_origin_3.wav" type="audio/mp3">
|
58 |
+
Your browser does not support the audio element.
|
59 |
+
</audio>
|
60 |
+
|
61 |
+
Finetuned model:
|
62 |
+
<audio controls>
|
63 |
+
<source src="https://huggingface.co/sinhprous/F5TTS-stabilized-LJSpeech/resolve/main/audio_samples/sample_aligned_3.wav" type="audio/mp3">
|
64 |
+
Your browser does not support the audio element.
|
65 |
+
</audio>
|
66 |
|
67 |
## License
|
68 |
This model is released under the Creative Commons Attribution Non Commercial Share Alike 4.0 license, which allows for free usage, modification, and distribution
|