TalTechNLP
/

voxlingua107-epaca-tdnn

Audio Classification

Model card Files Files and versions Community

TanelAlumae commited on 29 days ago

Commit

1b1adee

•

1 Parent(s): dfa22af

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -195,7 +195,7 @@ Since the model is trained on VoxLingua107, it has many limitations and biases,
 ## Training data
-The model is trained on [VoxLingua107](http://bark.phon.ioc.ee/voxlingua107/).
 VoxLingua107 is a speech dataset for training spoken language identification models.
 The dataset consists of short speech segments automatically extracted from YouTube videos and labeled according the language of the video title and description, with some post-processing steps to filter out false positives.

 ## Training data
+The model is trained on [VoxLingua107](https://cs.taltech.ee/staff/tanel.alumae/data/voxlingua107/).
 VoxLingua107 is a speech dataset for training spoken language identification models.
 The dataset consists of short speech segments automatically extracted from YouTube videos and labeled according the language of the video title and description, with some post-processing steps to filter out false positives.