Spaces:
Sleeping
Sleeping
Commit
·
07f1864
1
Parent(s):
1700872
style
Browse files
app.py
CHANGED
@@ -325,16 +325,15 @@ with gr.Blocks(css=css) as block:
|
|
325 |
<p><a href="https://github.com/huggingface/parler-tts"> Parler-TTS</a> is a training and inference library for
|
326 |
high-fidelity text-to-speech (TTS) models. Two models are demonstrated here, <a href="https://huggingface.co/parler-tts/parler_tts_mini_v0.1"> Parler-TTS Mini v0.1</a>,
|
327 |
is the first iteration model trained using 10k hours of narrated audiobooks, and <a href="https://huggingface.co/ylacombe/parler-tts-mini-jenny-30H"> Parler-TTS Jenny</a>,
|
328 |
-
a model fine-tuned on the <a href="https://huggingface.co/datasets/reach-vb/jenny_tts_dataset"> Jenny dataset</a
|
329 |
-
|
330 |
-
<p>Both models generates high-quality speech with features that can be controlled using a simple text prompt (e.g. gender, background noise, speaking rate, pitch and reverberation).</p>
|
331 |
|
332 |
<p>Tips for ensuring good generation:
|
333 |
<ul>
|
334 |
<li>Include the term "very clear audio" to generate the highest quality audio, and "very noisy audio" for high levels of background noise</li>
|
|
|
335 |
<li>Punctuation can be used to control the prosody of the generations, e.g. use commas to add small breaks in speech</li>
|
336 |
<li>The remaining speech features (gender, speaking rate, pitch and reverberation) can be controlled directly through the prompt</li>
|
337 |
-
<li>Include the term "Jenny" when using the fine-tuned Jenny model to pick out her voice</li>
|
338 |
</ul>
|
339 |
</p>
|
340 |
"""
|
|
|
325 |
<p><a href="https://github.com/huggingface/parler-tts"> Parler-TTS</a> is a training and inference library for
|
326 |
high-fidelity text-to-speech (TTS) models. Two models are demonstrated here, <a href="https://huggingface.co/parler-tts/parler_tts_mini_v0.1"> Parler-TTS Mini v0.1</a>,
|
327 |
is the first iteration model trained using 10k hours of narrated audiobooks, and <a href="https://huggingface.co/ylacombe/parler-tts-mini-jenny-30H"> Parler-TTS Jenny</a>,
|
328 |
+
a model fine-tuned on the <a href="https://huggingface.co/datasets/reach-vb/jenny_tts_dataset"> Jenny dataset</a>.
|
329 |
+
Both models generates high-quality speech with features that can be controlled using a simple text prompt (e.g. gender, background noise, speaking rate, pitch and reverberation).</p>
|
|
|
330 |
|
331 |
<p>Tips for ensuring good generation:
|
332 |
<ul>
|
333 |
<li>Include the term "very clear audio" to generate the highest quality audio, and "very noisy audio" for high levels of background noise</li>
|
334 |
+
<li>When using the fine-tuned model, include the term "Jenny" to pick out her voice</li>
|
335 |
<li>Punctuation can be used to control the prosody of the generations, e.g. use commas to add small breaks in speech</li>
|
336 |
<li>The remaining speech features (gender, speaking rate, pitch and reverberation) can be controlled directly through the prompt</li>
|
|
|
337 |
</ul>
|
338 |
</p>
|
339 |
"""
|