Updated README
Browse files
README.md
CHANGED
@@ -9,9 +9,9 @@ tags:
|
|
9 |
inference: false
|
10 |
datasets:
|
11 |
- bookbot/sw-TZ-Victoria
|
12 |
-
- bookbot/sw-TZ-Victoria-syllables
|
13 |
- bookbot/sw-TZ-Victoria-v2
|
14 |
-
- bookbot/sw-TZ-VictoriaNeural
|
15 |
---
|
16 |
|
17 |
# MB-MelGAN HiFi PostNets SW v4
|
@@ -19,9 +19,9 @@ datasets:
|
|
19 |
MB-MelGAN HiFi PostNets SW v4 is a mel-to-wav model based on the [MB-MelGAN](https://arxiv.org/abs/2005.05106) architecture with [HiFi-GAN](https://arxiv.org/abs/2010.05646) discriminator. This model was trained from scratch on trained on real and synthetic audio datasets. Instead of training on ground truth waveform spectrograms, this model was trained on the generated PostNet spectrograms of [LightSpeech MFA SW v4](https://huggingface.co/bookbot/lightspeech-mfa-sw-v4). The list of speakers include:
|
20 |
|
21 |
- sw-TZ-Victoria
|
22 |
-
- sw-TZ-Victoria-syllables
|
23 |
- sw-TZ-Victoria-v2
|
24 |
-
- sw-TZ-VictoriaNeural
|
25 |
|
26 |
This model was trained using the [TensorFlowTTS](https://github.com/TensorSpeech/TensorFlowTTS) framework. All training was done on a RTX 4090 GPU. All necessary scripts used for training could be found in this [Github Fork](https://github.com/bookbot-hive/TensorFlowTTS), as well as the [Training metrics](https://huggingface.co/bookbot/mb-melgan-hifi-postnets-sw-v4/tensorboard) logged via Tensorboard.
|
27 |
|
|
|
9 |
inference: false
|
10 |
datasets:
|
11 |
- bookbot/sw-TZ-Victoria
|
12 |
+
- bookbot/sw-TZ-Victoria-syllables-word
|
13 |
- bookbot/sw-TZ-Victoria-v2
|
14 |
+
- bookbot/sw-TZ-VictoriaNeural-upsampled-48kHz
|
15 |
---
|
16 |
|
17 |
# MB-MelGAN HiFi PostNets SW v4
|
|
|
19 |
MB-MelGAN HiFi PostNets SW v4 is a mel-to-wav model based on the [MB-MelGAN](https://arxiv.org/abs/2005.05106) architecture with [HiFi-GAN](https://arxiv.org/abs/2010.05646) discriminator. This model was trained from scratch on trained on real and synthetic audio datasets. Instead of training on ground truth waveform spectrograms, this model was trained on the generated PostNet spectrograms of [LightSpeech MFA SW v4](https://huggingface.co/bookbot/lightspeech-mfa-sw-v4). The list of speakers include:
|
20 |
|
21 |
- sw-TZ-Victoria
|
22 |
+
- sw-TZ-Victoria-syllables-word
|
23 |
- sw-TZ-Victoria-v2
|
24 |
+
- sw-TZ-VictoriaNeural-upsampled-48kHz
|
25 |
|
26 |
This model was trained using the [TensorFlowTTS](https://github.com/TensorSpeech/TensorFlowTTS) framework. All training was done on a RTX 4090 GPU. All necessary scripts used for training could be found in this [Github Fork](https://github.com/bookbot-hive/TensorFlowTTS), as well as the [Training metrics](https://huggingface.co/bookbot/mb-melgan-hifi-postnets-sw-v4/tensorboard) logged via Tensorboard.
|
27 |
|