nguyenvulebinh
/

wav2vec2-base-vietnamese-250h

Automatic Speech Recognition

Inference Endpoints

Model card Files Files and versions Community

nguyenvulebinh commited on Sep 3, 2021

Commit

fc34fdb

•

1 Parent(s): f121903

Update README.md

Files changed (1) hide show

README.md +29 -4

README.md CHANGED Viewed

@@ -14,6 +14,31 @@ widget:
   src: https://huggingface.co/nguyenvulebinh/wav2vec2-base-vietnamese-250h/raw/main/audio-test/t1_utt000000042.wav
 - label: VLSP ASR 2020 test T2
   src: https://huggingface.co/nguyenvulebinh/wav2vec2-base-vietnamese-250h/raw/main/audio-test/t2_0000006682.wav
 ---
 # Wav2Vec2-Base-250h for the Vietnamese language
@@ -40,10 +65,10 @@ Detail of training and fine-tuning process, the audience can follow [fairseq git
 ### Benchmark WER result:
-| | [VIVOS](https://ailab.hcmus.edu.vn/vivos) | [VLSP-T1](https://vlsp.org.vn/vlsp2020/eval/asr) | [VLSP-T2](https://vlsp.org.vn/vlsp2020/eval/asr) |
-|---|---|---|---|
-|without LM| 10.77 | 13.33 | 51.45 |
-|with 4-grams LM| 6.15 | 9.11 | 40.81 |
 ### Example usage

   src: https://huggingface.co/nguyenvulebinh/wav2vec2-base-vietnamese-250h/raw/main/audio-test/t1_utt000000042.wav
 - label: VLSP ASR 2020 test T2
   src: https://huggingface.co/nguyenvulebinh/wav2vec2-base-vietnamese-250h/raw/main/audio-test/t2_0000006682.wav
+model-index:
+- name: Vietnamese end-to-end speech recognition using wav2vec 2.0 by VietAI
+  results:
+  - task:
+      name: Speech Recognition
+      type: automatic-speech-recognition
+    dataset:
+      name: Common Voice vi
+      type: common_voice
+      args: vi
+    metrics:
+       - name: Test WER
+         type: wer
+         value: 11.52
+  - task:
+      name: Speech Recognition
+      type: automatic-speech-recognition
+    dataset:
+      name: VIVOS
+      type: vivos
+      args: vi
+    metrics:
+       - name: Test WER
+         type: wer
+         value: 6.15
 ---
 # Wav2Vec2-Base-250h for the Vietnamese language
 ### Benchmark WER result:
+| | [VIVOS](https://ailab.hcmus.edu.vn/vivos) | [COMMON VOICE VI](https://paperswithcode.com/dataset/common-voice) | [VLSP-T1](https://vlsp.org.vn/vlsp2020/eval/asr) | [VLSP-T2](https://vlsp.org.vn/vlsp2020/eval/asr) |
+|---|---|---|---|---|
+|without LM| 10.77 | 18.34 | 13.33 | 51.45 |
+|with 4-grams LM| 6.15 | 11.52 | 9.11 | 40.81 |
 ### Example usage