Update README.md
Browse files
README.md
CHANGED
@@ -45,10 +45,10 @@ If you find this model useful, please cite our paper:
|
|
45 |
This is a TTS variant of the SpeechT5 model,
|
46 |
i.e., the input modality is text (processed via the text pre-net),
|
47 |
and the output modality is speech (processed via the speech post-net).
|
48 |
-
The model has the same format as the [
|
49 |
|
50 |
In order to use this model for text-to-speech, it must be [fine-tuned](https://huggingface.co/learn/audio-course/chapter6/fine-tuning) on labeled TTS data.
|
51 |
|
52 |
-
The usage is the same as for the [
|
53 |
|
54 |
## Related works
|
|
|
45 |
This is a TTS variant of the SpeechT5 model,
|
46 |
i.e., the input modality is text (processed via the text pre-net),
|
47 |
and the output modality is speech (processed via the speech post-net).
|
48 |
+
The model has the same format as the [microsoft/speecht5_tts](https://huggingface.co/microsoft/speecht5_tts) compatible with the [SpeechT5ForTextToSpeech class](https://huggingface.co/docs/transformers/model_doc/speecht5#transformers.SpeechT5ForTextToSpeech).
|
49 |
|
50 |
In order to use this model for text-to-speech, it must be [fine-tuned](https://huggingface.co/learn/audio-course/chapter6/fine-tuning) on labeled TTS data.
|
51 |
|
52 |
+
The usage is the same as for the [microsoft/speecht5_tts](https://huggingface.co/microsoft/speecht5_tts)
|
53 |
|
54 |
## Related works
|