bookbot
/

mb-melgan-hifi-postnets-sw-v4

Model card Files Files and versions Metrics Training metrics Community

w11wo commited on Aug 5, 2024

Commit

4e186aa

·

1 Parent(s): fdb9167

Updated README

Files changed (1) hide show

README.md +4 -4

README.md CHANGED Viewed

@@ -9,9 +9,9 @@ tags:
 inference: false
 datasets:
   - bookbot/sw-TZ-Victoria
-  - bookbot/sw-TZ-Victoria-syllables
   - bookbot/sw-TZ-Victoria-v2
-  - bookbot/sw-TZ-VictoriaNeural
 ---
 # MB-MelGAN HiFi PostNets SW v4
@@ -19,9 +19,9 @@ datasets:
 MB-MelGAN HiFi PostNets SW v4 is a mel-to-wav model based on the [MB-MelGAN](https://arxiv.org/abs/2005.05106) architecture with [HiFi-GAN](https://arxiv.org/abs/2010.05646) discriminator. This model was trained from scratch on trained on real and synthetic audio datasets. Instead of training on ground truth waveform spectrograms, this model was trained on the generated PostNet spectrograms of [LightSpeech MFA SW v4](https://huggingface.co/bookbot/lightspeech-mfa-sw-v4). The list of speakers include:
 - sw-TZ-Victoria
-- sw-TZ-Victoria-syllables
 - sw-TZ-Victoria-v2
-- sw-TZ-VictoriaNeural
 This model was trained using the [TensorFlowTTS](https://github.com/TensorSpeech/TensorFlowTTS) framework. All training was done on a RTX 4090 GPU. All necessary scripts used for training could be found in this [Github Fork](https://github.com/bookbot-hive/TensorFlowTTS), as well as the [Training metrics](https://huggingface.co/bookbot/mb-melgan-hifi-postnets-sw-v4/tensorboard) logged via Tensorboard.

 inference: false
 datasets:
   - bookbot/sw-TZ-Victoria
+  - bookbot/sw-TZ-Victoria-syllables-word
   - bookbot/sw-TZ-Victoria-v2
+  - bookbot/sw-TZ-VictoriaNeural-upsampled-48kHz
 ---
 # MB-MelGAN HiFi PostNets SW v4
 MB-MelGAN HiFi PostNets SW v4 is a mel-to-wav model based on the [MB-MelGAN](https://arxiv.org/abs/2005.05106) architecture with [HiFi-GAN](https://arxiv.org/abs/2010.05646) discriminator. This model was trained from scratch on trained on real and synthetic audio datasets. Instead of training on ground truth waveform spectrograms, this model was trained on the generated PostNet spectrograms of [LightSpeech MFA SW v4](https://huggingface.co/bookbot/lightspeech-mfa-sw-v4). The list of speakers include:
 - sw-TZ-Victoria
+- sw-TZ-Victoria-syllables-word
 - sw-TZ-Victoria-v2
+- sw-TZ-VictoriaNeural-upsampled-48kHz
 This model was trained using the [TensorFlowTTS](https://github.com/TensorSpeech/TensorFlowTTS) framework. All training was done on a RTX 4090 GPU. All necessary scripts used for training could be found in this [Github Fork](https://github.com/bookbot-hive/TensorFlowTTS), as well as the [Training metrics](https://huggingface.co/bookbot/mb-melgan-hifi-postnets-sw-v4/tensorboard) logged via Tensorboard.