PyTorch
Catalan
TTS
audio
synthesis
VITS
speech
coqui.ai
carmentano commited on
Commit
40dddcf
1 Parent(s): 860ad74

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +5 -1
README.md CHANGED
@@ -23,7 +23,11 @@ datasets:
23
  # Aina Project's Catalan multi-speaker text-to-speech model
24
  ## Model description
25
 
26
- This model was trained from scratch using the [Coqui TTS](https://github.com/coqui-ai/TTS) toolkit on a combination of 3 datasets: [Festcat](http://festcat.talp.cat/devel.php), [OpenSLR69](http://openslr.org/69/) and [Common Voice v12](https://commonvoice.mozilla.org/ca). For the training, we used 487 hours of recordings from 255 speakers. We have trimmed and denoised the data which all except Common Voice can be found in a seperate dataset in [festcat_trimmed_denoised](projecte-aina/festcat_trimmed_denoised) and [openslr69_trimmed_denoised](projecte-aina/openslr-slr69-ca-trimmed-denoised).
 
 
 
 
27
 
28
  A live inference demo can be found in our spaces, [here](https://huggingface.co/spaces/projecte-aina/tts-ca-coqui-vits-multispeaker).
29
 
 
23
  # Aina Project's Catalan multi-speaker text-to-speech model
24
  ## Model description
25
 
26
+ This model was trained from scratch using the [Coqui TTS](https://github.com/coqui-ai/TTS) toolkit on a combination of 3 datasets:
27
+ [Festcat](http://festcat.talp.cat/devel.php), [OpenSLR69](http://openslr.org/69/) and [Common Voice v12](https://commonvoice.mozilla.org/ca).
28
+ For the training, we used 487 hours of recordings from 255 speakers.
29
+ We have trimmed and denoised the data which all except Common Voice can be found in a seperate dataset
30
+ in [festcat_trimmed_denoised](https://huggingface.co/datasets/projecte-aina/festcat_trimmed_denoised) and [openslr69_trimmed_denoised](https://huggingface.co/datasets/projecte-aina/openslr-slr69-ca-trimmed-denoised).
31
 
32
  A live inference demo can be found in our spaces, [here](https://huggingface.co/spaces/projecte-aina/tts-ca-coqui-vits-multispeaker).
33