Adel-Moumen
commited on
Update README.md
Browse files
README.md
CHANGED
@@ -41,7 +41,7 @@ The following is an example of converting text-to-speech with the speaker voice
|
|
41 |
**Note:**
|
42 |
- The model generates speech at a rate of 22050 Hz, but it's important to note that the input signal, crucial for capturing speaker identities, must be sampled at 16kHz.
|
43 |
|
44 |
-
```
|
45 |
import torchaudio
|
46 |
from speechbrain.inference.TTS import MSTacotron2
|
47 |
from speechbrain.inference.vocoders import HIFIGAN
|
@@ -66,7 +66,7 @@ torchaudio.save("synthesized_sample.wav", waveforms.squeeze(1).cpu(), 22050)
|
|
66 |
|
67 |
If you want to generate a random voice, you can use the following:
|
68 |
|
69 |
-
```
|
70 |
import torchaudio
|
71 |
from speechbrain.inference.TTS import MSTacotron2
|
72 |
from speechbrain.inference.vocoders import HIFIGAN
|
@@ -92,7 +92,7 @@ torchaudio.save("synthesized_sample.wav", waveforms.squeeze(1).cpu(), 22050)
|
|
92 |
If you want to generate multiple sentences in one-shot, you can do it this way:
|
93 |
Note: The model internally reorders the input texts in the decreasing order of their lengths.
|
94 |
|
95 |
-
```
|
96 |
import torchaudio
|
97 |
from speechbrain.inference.TTS import MSTacotron2
|
98 |
from speechbrain.inference.vocoders import HIFIGAN
|
|
|
41 |
**Note:**
|
42 |
- The model generates speech at a rate of 22050 Hz, but it's important to note that the input signal, crucial for capturing speaker identities, must be sampled at 16kHz.
|
43 |
|
44 |
+
```python
|
45 |
import torchaudio
|
46 |
from speechbrain.inference.TTS import MSTacotron2
|
47 |
from speechbrain.inference.vocoders import HIFIGAN
|
|
|
66 |
|
67 |
If you want to generate a random voice, you can use the following:
|
68 |
|
69 |
+
```python
|
70 |
import torchaudio
|
71 |
from speechbrain.inference.TTS import MSTacotron2
|
72 |
from speechbrain.inference.vocoders import HIFIGAN
|
|
|
92 |
If you want to generate multiple sentences in one-shot, you can do it this way:
|
93 |
Note: The model internally reorders the input texts in the decreasing order of their lengths.
|
94 |
|
95 |
+
```python
|
96 |
import torchaudio
|
97 |
from speechbrain.inference.TTS import MSTacotron2
|
98 |
from speechbrain.inference.vocoders import HIFIGAN
|