fav-kky
/

SpeechT5-base-cs-tts

Inference Endpoints

Model card Files Files and versions Community

jlehecka commited on Jul 25, 2024

Commit

65f2331

·

verified ·

1 Parent(s): 8c26849

Update README.md

Files changed (1) hide show

README.md +2 -2

README.md CHANGED Viewed

@@ -45,10 +45,10 @@ If you find this model useful, please cite our paper:
 This is a TTS variant of the SpeechT5 model,
 i.e., the input modality is text (processed via the text pre-net),
 and the output modality is speech (processed via the speech post-net).
-The model has the same format as the [English speecht5-tts model](https://huggingface.co/microsoft/speecht5_tts) compatible with the [SpeechT5ForTextToSpeech class](https://huggingface.co/docs/transformers/model_doc/speecht5#transformers.SpeechT5ForTextToSpeech).
 In order to use this model for text-to-speech, it must be [fine-tuned](https://huggingface.co/learn/audio-course/chapter6/fine-tuning) on labeled TTS data.
-The usage is the same as for the [English speecht5-tts model](https://huggingface.co/microsoft/speecht5_tts)
 ## Related works

 This is a TTS variant of the SpeechT5 model,
 i.e., the input modality is text (processed via the text pre-net),
 and the output modality is speech (processed via the speech post-net).
+The model has the same format as the [microsoft/speecht5_tts](https://huggingface.co/microsoft/speecht5_tts) compatible with the [SpeechT5ForTextToSpeech class](https://huggingface.co/docs/transformers/model_doc/speecht5#transformers.SpeechT5ForTextToSpeech).
 In order to use this model for text-to-speech, it must be [fine-tuned](https://huggingface.co/learn/audio-course/chapter6/fine-tuning) on labeled TTS data.
+The usage is the same as for the [microsoft/speecht5_tts](https://huggingface.co/microsoft/speecht5_tts)
 ## Related works