Spaces:
Running
on
Zero
Running
on
Zero
Update app.py
Browse files
app.py
CHANGED
@@ -24,9 +24,8 @@ State of the art text-to-speech model [[model]](https://huggingface.co/collectio
|
|
24 |
## Unleashed
|
25 |
Use this space to generate long-form speech up to around ~2 minutes in length. To generate an unlimited length, clone this space and run it locally.
|
26 |
### Tips
|
27 |
-
|
28 |
- When providing prefix audio, include the text of the prefix audio in your speech text to ensure a smooth transition.
|
29 |
-
- The cleaner the speaker audio, the better the speaker conditioning will be - however, speaker audio is only sampled at 16kHz, so you do not need to provide high-bitrate speaker audio. Unlike this, however, prefix audio should be high-quality, as it is sampled at the full 44.1kHz.
|
30 |
- The appropriate range of Speaking Rate and Pitch STD are highly dependent on the speaker audio. Start with the defaults and adjust as needed.
|
31 |
- Emotion sliders do not completely function intuitively, and require some experimentation to get the desired effect.
|
32 |
""".strip()
|
|
|
24 |
## Unleashed
|
25 |
Use this space to generate long-form speech up to around ~2 minutes in length. To generate an unlimited length, clone this space and run it locally.
|
26 |
### Tips
|
27 |
+
|
28 |
- When providing prefix audio, include the text of the prefix audio in your speech text to ensure a smooth transition.
|
|
|
29 |
- The appropriate range of Speaking Rate and Pitch STD are highly dependent on the speaker audio. Start with the defaults and adjust as needed.
|
30 |
- Emotion sliders do not completely function intuitively, and require some experimentation to get the desired effect.
|
31 |
""".strip()
|