Yehor
/

w2v-xls-r-uk

Automatic Speech Recognition

Inference Endpoints

Model card Files Files and versions Community

w2v-xls-r-uk / README.md

Yehor's picture

Add a note about the new model

b7b2571 verified 5 months ago

|

896 Bytes

metadata

language:
  - uk
license: apache-2.0
datasets:
  - mozilla-foundation/common_voice_10_0

🇺🇦 Join Ukrainian Speech Recognition Community - https://t.me/speech_recognition_uk

⭐ See other Ukrainian models - https://github.com/egorsmkv/speech-recognition-uk

ATTENTION!

USE UPDATED MODEL: https://huggingface.co/Yehor/w2v-bert-2.0-uk

This model has apostrophes and hyphens.

The language model is trained on the texts of the Common Voice dataset, which is used during training.

Metrics:

Dataset	CER	WER
CV7 (no LM)	0.0432	0.2288
CV7 (with LM)	0.0169	0.0706
CV10 (no LM)	0.0412	0.2206
CV10 (with LM)	0.0118	0.0463

More:

The same model, but trained on noisy data: https://huggingface.co/Yehor/wav2vec2-xls-r-300m-uk-with-small-lm-noisy
Traced JIT version: https://huggingface.co/Yehor/wav2vec2-xls-r-300m-uk-traced-jit