projecte-aina
/

tts-ca-coqui-vits-multispeaker

Model card Files Files and versions Community

carmentano commited on Dec 13, 2024

Commit

40dddcf

·

verified ·

1 Parent(s): 860ad74

Update README.md

Files changed (1) hide show

README.md +5 -1

README.md CHANGED Viewed

@@ -23,7 +23,11 @@ datasets:
 # Aina Project's Catalan multi-speaker text-to-speech model
 ## Model description
-This model was trained from scratch using the [Coqui TTS](https://github.com/coqui-ai/TTS) toolkit on a combination of 3 datasets: [Festcat](http://festcat.talp.cat/devel.php), [OpenSLR69](http://openslr.org/69/) and [Common Voice v12](https://commonvoice.mozilla.org/ca). For the training, we used 487 hours of recordings from 255 speakers. We have trimmed and denoised the data which all except Common Voice can be found in a seperate dataset in [festcat_trimmed_denoised](projecte-aina/festcat_trimmed_denoised) and [openslr69_trimmed_denoised](projecte-aina/openslr-slr69-ca-trimmed-denoised).
 A live inference demo can be found in our spaces, [here](https://huggingface.co/spaces/projecte-aina/tts-ca-coqui-vits-multispeaker).

 # Aina Project's Catalan multi-speaker text-to-speech model
 ## Model description
+This model was trained from scratch using the [Coqui TTS](https://github.com/coqui-ai/TTS) toolkit on a combination of 3 datasets:
+[Festcat](http://festcat.talp.cat/devel.php), [OpenSLR69](http://openslr.org/69/) and [Common Voice v12](https://commonvoice.mozilla.org/ca).
+For the training, we used 487 hours of recordings from 255 speakers.
+We have trimmed and denoised the data which all except Common Voice can be found in a seperate dataset
+in [festcat_trimmed_denoised](https://huggingface.co/datasets/projecte-aina/festcat_trimmed_denoised) and [openslr69_trimmed_denoised](https://huggingface.co/datasets/projecte-aina/openslr-slr69-ca-trimmed-denoised).
 A live inference demo can be found in our spaces, [here](https://huggingface.co/spaces/projecte-aina/tts-ca-coqui-vits-multispeaker).