Update README.md
Browse files
README.md
CHANGED
@@ -11,10 +11,12 @@ tags:
|
|
11 |
### How to use
|
12 |
|
13 |
See example of inference pipeline for Russian TTS (G2P + FastPitch + HifiGAN) in this [notebook](https://github.com/bene-ges/nemo_compatible/blob/main/notebooks/Russian_TTS_with_IPA_G2P_FastPitch_and_HifiGAN.ipynb).
|
|
|
14 |
|
15 |
### Input
|
16 |
|
17 |
-
This model
|
|
|
18 |
If you feed plain text directly, it will work, but quality will be low.
|
19 |
|
20 |
### Output
|
@@ -24,6 +26,7 @@ This model generates mel spectrograms.
|
|
24 |
## Training
|
25 |
|
26 |
The NeMo toolkit [1] was used for training the model for 1000+ epochs.
|
|
|
27 |
|
28 |
### Datasets
|
29 |
|
|
|
11 |
### How to use
|
12 |
|
13 |
See example of inference pipeline for Russian TTS (G2P + FastPitch + HifiGAN) in this [notebook](https://github.com/bene-ges/nemo_compatible/blob/main/notebooks/Russian_TTS_with_IPA_G2P_FastPitch_and_HifiGAN.ipynb).
|
14 |
+
Or use this [bash-script](https://github.com/bene-ges/nemo_compatible/blob/main/scripts/tts/ru_ipa_fastpitch_hifigan/test.sh)
|
15 |
|
16 |
### Input
|
17 |
|
18 |
+
This model is indended to be used in a G2P + FastPitch + HifiGAN pipeline (see above).
|
19 |
+
If run independently, it expects text converted to IPA-like transcriptions. See this [g2p model](https://huggingface.co/bene-ges/ru_g2p_ipa_bert_large) for conversion of plain Russian text to phonemes.
|
20 |
If you feed plain text directly, it will work, but quality will be low.
|
21 |
|
22 |
### Output
|
|
|
26 |
## Training
|
27 |
|
28 |
The NeMo toolkit [1] was used for training the model for 1000+ epochs.
|
29 |
+
Full training script is [here](https://github.com/bene-ges/nemo_compatible/blob/main/scripts/tts/ru_ipa_fastpitch_hifigan/train.sh)
|
30 |
|
31 |
### Datasets
|
32 |
|