Update README.md
Browse files
README.md
CHANGED
@@ -144,18 +144,18 @@ This repository contains:
|
|
144 |
**Languages present not indexed by Huggingface:** Asturian (ast), Basaa (bas), Cebuano (ceb), Central Kurdish/Sorani (ckb), Hakha Chin (cnh), Hawaiian (haw), Upper Sorbian (hsb) Kabyle (kab), Moksha (mdf), Meadow Mari (mhr), Hill Mari (mrj), Erzya (myv), Taiwanese Hokkien (nan-tw), Sursilvan (rm-sursilv), Vallader (rm-vallader), Sakha (sah), Santali (sat), Scots (sco), Saraiki (skr), Tigre (tig), Tok Pisin (tpi), Akwapen Twi (tw-akuapem), Asante Twi (tw-asante), Votic (vot), Waray (war), Cantonese (yue).
|
145 |
|
146 |
**Datasets:**
|
147 |
-
* Aishell
|
148 |
-
* BibleTTS
|
149 |
-
* ClovaCall
|
150 |
-
* CommonVoice v11
|
151 |
-
* Google TTS data
|
152 |
-
* IISc-MILE
|
153 |
-
*
|
154 |
-
* Kokoro
|
155 |
-
* Kosp2e
|
156 |
-
* Media Speech
|
157 |
-
* Multilingual LibriSpeech
|
158 |
-
* Samrómur
|
159 |
-
* THCHS-30 and THUYG-20
|
160 |
-
* VoxLingua107
|
161 |
-
* VoxPopuli
|
|
|
144 |
**Languages present not indexed by Huggingface:** Asturian (ast), Basaa (bas), Cebuano (ceb), Central Kurdish/Sorani (ckb), Hakha Chin (cnh), Hawaiian (haw), Upper Sorbian (hsb) Kabyle (kab), Moksha (mdf), Meadow Mari (mhr), Hill Mari (mrj), Erzya (myv), Taiwanese Hokkien (nan-tw), Sursilvan (rm-sursilv), Vallader (rm-vallader), Sakha (sah), Santali (sat), Scots (sco), Saraiki (skr), Tigre (tig), Tok Pisin (tpi), Akwapen Twi (tw-akuapem), Asante Twi (tw-asante), Votic (vot), Waray (war), Cantonese (yue).
|
145 |
|
146 |
**Datasets:**
|
147 |
+
* [Aishell](https://www.openslr.org/33/) and [AISHELL-3](https://www.openslr.org/93/)
|
148 |
+
* [BibleTTS](https://www.openslr.org/129/)
|
149 |
+
* [ClovaCall](https://github.com/clovaai/ClovaCall)
|
150 |
+
* [CommonVoice v11](https://commonvoice.mozilla.org/en/datasets)
|
151 |
+
* Google TTS data: [Javanese](https://www.openslr.org/41/), [Khmer](https://www.openslr.org/42/), [Nepali](https://www.openslr.org/43/), [Sundanese](https://www.openslr.org/44/), [South African Languages](https://www.openslr.org/32/), [Bengali Languages](https://www.openslr.org/37/)
|
152 |
+
* IISc-MILE: [Tamil](https://www.openslr.org/127/), [Kannada](https://www.openslr.org/126/)
|
153 |
+
* [Japanese Versatile Speech](https://sites.google.com/site/shinnosuketakamichi/research-topics/jvs_corpus)
|
154 |
+
* [Kokoro](https://github.com/kaiidams/Kokoro-Speech-Dataset)
|
155 |
+
* [Kosp2e](https://github.com/warnikchow/kosp2e)
|
156 |
+
* Media Speech: [Turkish Only](https://www.openslr.org/108/)
|
157 |
+
* [Multilingual LibriSpeech](https://www.openslr.org/94/)
|
158 |
+
* [Samrómur](https://www.openslr.org/128/)
|
159 |
+
* [THCHS-30](https://www.openslr.org/18/) and [THUYG-20](https://www.openslr.org/22/)
|
160 |
+
* [VoxLingua107](https://bark.phon.ioc.ee/voxlingua107/)
|
161 |
+
* [VoxPopuli](https://github.com/facebookresearch/voxpopuli/)
|