nguyenvulebinh
commited on
Commit
•
fc34fdb
1
Parent(s):
f121903
Update README.md
Browse files
README.md
CHANGED
@@ -14,6 +14,31 @@ widget:
|
|
14 |
src: https://huggingface.co/nguyenvulebinh/wav2vec2-base-vietnamese-250h/raw/main/audio-test/t1_utt000000042.wav
|
15 |
- label: VLSP ASR 2020 test T2
|
16 |
src: https://huggingface.co/nguyenvulebinh/wav2vec2-base-vietnamese-250h/raw/main/audio-test/t2_0000006682.wav
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
17 |
---
|
18 |
|
19 |
# Wav2Vec2-Base-250h for the Vietnamese language
|
@@ -40,10 +65,10 @@ Detail of training and fine-tuning process, the audience can follow [fairseq git
|
|
40 |
|
41 |
### Benchmark WER result:
|
42 |
|
43 |
-
| | [VIVOS](https://ailab.hcmus.edu.vn/vivos) | [VLSP-T1](https://vlsp.org.vn/vlsp2020/eval/asr) | [VLSP-T2](https://vlsp.org.vn/vlsp2020/eval/asr) |
|
44 |
-
|
45 |
-
|without LM| 10.77 | 13.33 | 51.45 |
|
46 |
-
|with 4-grams LM| 6.15 | 9.11 | 40.81 |
|
47 |
|
48 |
|
49 |
### Example usage
|
|
|
14 |
src: https://huggingface.co/nguyenvulebinh/wav2vec2-base-vietnamese-250h/raw/main/audio-test/t1_utt000000042.wav
|
15 |
- label: VLSP ASR 2020 test T2
|
16 |
src: https://huggingface.co/nguyenvulebinh/wav2vec2-base-vietnamese-250h/raw/main/audio-test/t2_0000006682.wav
|
17 |
+
model-index:
|
18 |
+
- name: Vietnamese end-to-end speech recognition using wav2vec 2.0 by VietAI
|
19 |
+
results:
|
20 |
+
- task:
|
21 |
+
name: Speech Recognition
|
22 |
+
type: automatic-speech-recognition
|
23 |
+
dataset:
|
24 |
+
name: Common Voice vi
|
25 |
+
type: common_voice
|
26 |
+
args: vi
|
27 |
+
metrics:
|
28 |
+
- name: Test WER
|
29 |
+
type: wer
|
30 |
+
value: 11.52
|
31 |
+
- task:
|
32 |
+
name: Speech Recognition
|
33 |
+
type: automatic-speech-recognition
|
34 |
+
dataset:
|
35 |
+
name: VIVOS
|
36 |
+
type: vivos
|
37 |
+
args: vi
|
38 |
+
metrics:
|
39 |
+
- name: Test WER
|
40 |
+
type: wer
|
41 |
+
value: 6.15
|
42 |
---
|
43 |
|
44 |
# Wav2Vec2-Base-250h for the Vietnamese language
|
|
|
65 |
|
66 |
### Benchmark WER result:
|
67 |
|
68 |
+
| | [VIVOS](https://ailab.hcmus.edu.vn/vivos) | [COMMON VOICE VI](https://paperswithcode.com/dataset/common-voice) | [VLSP-T1](https://vlsp.org.vn/vlsp2020/eval/asr) | [VLSP-T2](https://vlsp.org.vn/vlsp2020/eval/asr) |
|
69 |
+
|---|---|---|---|---|
|
70 |
+
|without LM| 10.77 | 18.34 | 13.33 | 51.45 |
|
71 |
+
|with 4-grams LM| 6.15 | 11.52 | 9.11 | 40.81 |
|
72 |
|
73 |
|
74 |
### Example usage
|