w11wo commited on
Commit
4e186aa
·
1 Parent(s): fdb9167

Updated README

Browse files
Files changed (1) hide show
  1. README.md +4 -4
README.md CHANGED
@@ -9,9 +9,9 @@ tags:
9
  inference: false
10
  datasets:
11
  - bookbot/sw-TZ-Victoria
12
- - bookbot/sw-TZ-Victoria-syllables
13
  - bookbot/sw-TZ-Victoria-v2
14
- - bookbot/sw-TZ-VictoriaNeural
15
  ---
16
 
17
  # MB-MelGAN HiFi PostNets SW v4
@@ -19,9 +19,9 @@ datasets:
19
  MB-MelGAN HiFi PostNets SW v4 is a mel-to-wav model based on the [MB-MelGAN](https://arxiv.org/abs/2005.05106) architecture with [HiFi-GAN](https://arxiv.org/abs/2010.05646) discriminator. This model was trained from scratch on trained on real and synthetic audio datasets. Instead of training on ground truth waveform spectrograms, this model was trained on the generated PostNet spectrograms of [LightSpeech MFA SW v4](https://huggingface.co/bookbot/lightspeech-mfa-sw-v4). The list of speakers include:
20
 
21
  - sw-TZ-Victoria
22
- - sw-TZ-Victoria-syllables
23
  - sw-TZ-Victoria-v2
24
- - sw-TZ-VictoriaNeural
25
 
26
  This model was trained using the [TensorFlowTTS](https://github.com/TensorSpeech/TensorFlowTTS) framework. All training was done on a RTX 4090 GPU. All necessary scripts used for training could be found in this [Github Fork](https://github.com/bookbot-hive/TensorFlowTTS), as well as the [Training metrics](https://huggingface.co/bookbot/mb-melgan-hifi-postnets-sw-v4/tensorboard) logged via Tensorboard.
27
 
 
9
  inference: false
10
  datasets:
11
  - bookbot/sw-TZ-Victoria
12
+ - bookbot/sw-TZ-Victoria-syllables-word
13
  - bookbot/sw-TZ-Victoria-v2
14
+ - bookbot/sw-TZ-VictoriaNeural-upsampled-48kHz
15
  ---
16
 
17
  # MB-MelGAN HiFi PostNets SW v4
 
19
  MB-MelGAN HiFi PostNets SW v4 is a mel-to-wav model based on the [MB-MelGAN](https://arxiv.org/abs/2005.05106) architecture with [HiFi-GAN](https://arxiv.org/abs/2010.05646) discriminator. This model was trained from scratch on trained on real and synthetic audio datasets. Instead of training on ground truth waveform spectrograms, this model was trained on the generated PostNet spectrograms of [LightSpeech MFA SW v4](https://huggingface.co/bookbot/lightspeech-mfa-sw-v4). The list of speakers include:
20
 
21
  - sw-TZ-Victoria
22
+ - sw-TZ-Victoria-syllables-word
23
  - sw-TZ-Victoria-v2
24
+ - sw-TZ-VictoriaNeural-upsampled-48kHz
25
 
26
  This model was trained using the [TensorFlowTTS](https://github.com/TensorSpeech/TensorFlowTTS) framework. All training was done on a RTX 4090 GPU. All necessary scripts used for training could be found in this [Github Fork](https://github.com/bookbot-hive/TensorFlowTTS), as well as the [Training metrics](https://huggingface.co/bookbot/mb-melgan-hifi-postnets-sw-v4/tensorboard) logged via Tensorboard.
27