vumichien commited on
Commit
94700a6
·
1 Parent(s): 769ad21

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +4 -1
README.md CHANGED
@@ -57,7 +57,10 @@ model-index:
57
  ---
58
  ## Model description
59
 
60
- This model is a fine-tuned version of [facebook/wav2vec2-xls-r-1b](https://huggingface.co/facebook/wav2vec2-xls-r-1b) on my collection of Public Japanese Voice datasets for research [Common Voice 7.0](https://huggingface.co/datasets/mozilla-foundation/common_voice_7_0), [JUST](https://sites.google.com/site/shinnosuketakamichi/publication/jsut) (Japanese speech corpus of Saruwatari-lab., University of Tokyo), [JSSS](https://sites.google.com/site/shinnosuketakamichi/research-topics/jsss_corpus) (Japanese speech corpus for summarization and simplification), [CSS10](https://paperswithcode.com/dataset/css10) (A collection of single speaker speech datasets). You can find in preprocessing dataset in here VUMICHIEN/COMMON_VOICE_LARGE_JSUT_JSSS_CSS10. Total training data ~60 hours.
 
 
 
61
  ### Benchmark WER result:
62
  | | [COMMON VOICE 7.0](https://huggingface.co/datasets/mozilla-foundation/common_voice_7_0) | [COMMON VOICE 8.0](https://huggingface.co/datasets/mozilla-foundation/common_voice_8_0)
63
  |---|---|---|
 
57
  ---
58
  ## Model description
59
 
60
+ This model is a fine-tuned version of [facebook/wav2vec2-xls-r-1b](https://huggingface.co/facebook/wav2vec2-xls-r-1b) on my collection of Public Japanese Voice datasets for research [Common Voice 7.0](https://huggingface.co/datasets/mozilla-foundation/common_voice_7_0), [JUST](https://sites.google.com/site/shinnosuketakamichi/publication/jsut) (Japanese speech corpus of Saruwatari-lab., University of Tokyo), [JSSS](https://sites.google.com/site/shinnosuketakamichi/research-topics/jsss_corpus) (Japanese speech corpus for summarization and simplification), [CSS10](https://paperswithcode.com/dataset/css10) (A collection of single speaker speech datasets). You can find in preprocessing dataset in here VUMICHIEN/COMMON_VOICE_LARGE_JSUT_JSSS_CSS10.
61
+
62
+ **Total training time:** ~60 hours.
63
+
64
  ### Benchmark WER result:
65
  | | [COMMON VOICE 7.0](https://huggingface.co/datasets/mozilla-foundation/common_voice_7_0) | [COMMON VOICE 8.0](https://huggingface.co/datasets/mozilla-foundation/common_voice_8_0)
66
  |---|---|---|