File size: 1,209 Bytes
a3c0c78
883d070
a3c0c78
 
84403d1
883d070
 
a3c0c78
2df817d
883d070
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
a3c0c78
 
f4b5de0
38b25a9
29ae0cf
38b25a9
 
 
883d070
6d16672
883d070
 
 
f025181
883d070
b7b2571
2519041
6d16672
a218480
bbee575
84403d1
f73f733
84403d1
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
---
base_model: facebook/wav2vec2-xls-r-300m
language: 
  - uk
license: "apache-2.0"
tags:
- automatic-speech-recognition
datasets:
- mozilla-foundation/common_voice_10_0
metrics:
  - wer
model-index:
  - name: w2v-xls-r-uk
    results:
      - task:
          name: Automatic Speech Recognition
          type: automatic-speech-recognition
        dataset:
          name: common_voice_10_0
          type: common_voice_10_0
          config: uk
          split: test
          args: uk
        metrics:
          - name: Wer
            type: wer
            value: 0.0463
---

🚨🚨🚨 **ATTENTION!** 🚨🚨🚨

**Use an updated model**: https://huggingface.co/Yehor/w2v-bert-uk-v2.1

---

## Community

- Discord: https://discord.gg/yVAjkBgmt4
- Speech Recognition: https://t.me/speech_recognition_uk
- Speech Synthesis: https://t.me/speech_synthesis_uk

## Overview

This model has apostrophes and hyphens.

The language model is trained on the texts of the Common Voice dataset, which is used during training.

Metrics:

| Dataset | CER | WER |
|-|-|-|
| CV7 (no LM) |  0.0432 | 0.2288 |
| CV7 (with LM) | 0.0169 | 0.0706 |
| CV10 (no LM) | 0.0412 | 0.2206 |
| CV10 (with LM) | 0.0118 | 0.0463 |