metadata
base_model: facebook/wav2vec2-xls-r-300m
language:
- uk
license: apache-2.0
tags:
- automatic-speech-recognition
datasets:
- mozilla-foundation/common_voice_10_0
metrics:
- wer
model-index:
- name: w2v-xls-r-uk
results:
- task:
name: Automatic Speech Recognition
type: automatic-speech-recognition
dataset:
name: common_voice_10_0
type: common_voice_10_0
config: uk
split: test
args: uk
metrics:
- name: Wer
type: wer
value: 0.0463
π¨π¨π¨ ATTENTION! π¨π¨π¨
Use an updated model: https://huggingface.co/Yehor/w2v-bert-uk-v2.1
Community
- Discord: https://discord.gg/yVAjkBgmt4
- Speech Recognition: https://t.me/speech_recognition_uk
- Speech Synthesis: https://t.me/speech_synthesis_uk
Overview
This model has apostrophes and hyphens.
The language model is trained on the texts of the Common Voice dataset, which is used during training.
Metrics:
Dataset | CER | WER |
---|---|---|
CV7 (no LM) | 0.0432 | 0.2288 |
CV7 (with LM) | 0.0169 | 0.0706 |
CV10 (no LM) | 0.0412 | 0.2206 |
CV10 (with LM) | 0.0118 | 0.0463 |