sanchit-gandhi commited on
Commit
07f1864
·
1 Parent(s): 1700872
Files changed (1) hide show
  1. app.py +3 -4
app.py CHANGED
@@ -325,16 +325,15 @@ with gr.Blocks(css=css) as block:
325
  <p><a href="https://github.com/huggingface/parler-tts"> Parler-TTS</a> is a training and inference library for
326
  high-fidelity text-to-speech (TTS) models. Two models are demonstrated here, <a href="https://huggingface.co/parler-tts/parler_tts_mini_v0.1"> Parler-TTS Mini v0.1</a>,
327
  is the first iteration model trained using 10k hours of narrated audiobooks, and <a href="https://huggingface.co/ylacombe/parler-tts-mini-jenny-30H"> Parler-TTS Jenny</a>,
328
- a model fine-tuned on the <a href="https://huggingface.co/datasets/reach-vb/jenny_tts_dataset"> Jenny dataset</a>.</p>
329
-
330
- <p>Both models generates high-quality speech with features that can be controlled using a simple text prompt (e.g. gender, background noise, speaking rate, pitch and reverberation).</p>
331
 
332
  <p>Tips for ensuring good generation:
333
  <ul>
334
  <li>Include the term "very clear audio" to generate the highest quality audio, and "very noisy audio" for high levels of background noise</li>
 
335
  <li>Punctuation can be used to control the prosody of the generations, e.g. use commas to add small breaks in speech</li>
336
  <li>The remaining speech features (gender, speaking rate, pitch and reverberation) can be controlled directly through the prompt</li>
337
- <li>Include the term "Jenny" when using the fine-tuned Jenny model to pick out her voice</li>
338
  </ul>
339
  </p>
340
  """
 
325
  <p><a href="https://github.com/huggingface/parler-tts"> Parler-TTS</a> is a training and inference library for
326
  high-fidelity text-to-speech (TTS) models. Two models are demonstrated here, <a href="https://huggingface.co/parler-tts/parler_tts_mini_v0.1"> Parler-TTS Mini v0.1</a>,
327
  is the first iteration model trained using 10k hours of narrated audiobooks, and <a href="https://huggingface.co/ylacombe/parler-tts-mini-jenny-30H"> Parler-TTS Jenny</a>,
328
+ a model fine-tuned on the <a href="https://huggingface.co/datasets/reach-vb/jenny_tts_dataset"> Jenny dataset</a>.
329
+ Both models generates high-quality speech with features that can be controlled using a simple text prompt (e.g. gender, background noise, speaking rate, pitch and reverberation).</p>
 
330
 
331
  <p>Tips for ensuring good generation:
332
  <ul>
333
  <li>Include the term "very clear audio" to generate the highest quality audio, and "very noisy audio" for high levels of background noise</li>
334
+ <li>When using the fine-tuned model, include the term "Jenny" to pick out her voice</li>
335
  <li>Punctuation can be used to control the prosody of the generations, e.g. use commas to add small breaks in speech</li>
336
  <li>The remaining speech features (gender, speaking rate, pitch and reverberation) can be controlled directly through the prompt</li>
 
337
  </ul>
338
  </p>
339
  """