KBLab
/

wav2vec2-large-voxpopuli-sv-swedish

Automatic Speech Recognition

Inference Endpoints

Model card Files Files and versions Community

marma commited on May 7, 2021

Commit

16a4caa

•

1 Parent(s): 2faa8a4

Update README.md

Files changed (1) hide show

README.md +14 -11

README.md CHANGED Viewed

@@ -5,7 +5,7 @@ datasets:
 - NST Swedish ASR Database
 metrics:
 - wer
-#- cer
 tags:
 - audio
 - automatic-speech-recognition
@@ -15,15 +15,15 @@ license: cc-by-nc-4.0
 model-index:
 - name: Wav2vec 2.0 large VoxPopuli-sv swedish
   results:
-  - task:
-      name: Speech Recognition
-      type: automatic-speech-recognition
-    dataset:
-      name: NST Swedish ASR Database
-    metrics:
-       - name: Test WER
-         type: wer
-         value: 5.192353080009441
   - task:
       name: Speech Recognition
       type: automatic-speech-recognition
@@ -34,7 +34,10 @@ model-index:
     metrics:
        - name: Test WER
          type: wer
-         value: 17.37743757973392
 ---
 # Wav2vec 2.0 large-voxpopuli-sv-swedish
 Finetuned version of Facebooks [VoxPopuli-sv large](https://huggingface.co/facebook/wav2vec2-large-sv-voxpopuli) model using NST and Common Voice data. Evalutation without a language model gives the following: WER for NST + Common Voice test set (2% of total sentences) is **5.19%**, WER for Common Voice test set is **17.38%**.

 - NST Swedish ASR Database
 metrics:
 - wer
+- cer
 tags:
 - audio
 - automatic-speech-recognition
 model-index:
 - name: Wav2vec 2.0 large VoxPopuli-sv swedish
   results:
+#  - task:
+#      name: Speech Recognition
+#      type: automatic-speech-recognition
+#    dataset:
+#      name: NST Swedish ASR Database
+#    metrics:
+#       - name: Test WER
+#         type: wer
+#         value: 5.192353080009441
   - task:
       name: Speech Recognition
       type: automatic-speech-recognition
     metrics:
        - name: Test WER
          type: wer
+         value: 14.343744
+       - name: Test CER
+         type: cer
+         value: 4.936313
 ---
 # Wav2vec 2.0 large-voxpopuli-sv-swedish
 Finetuned version of Facebooks [VoxPopuli-sv large](https://huggingface.co/facebook/wav2vec2-large-sv-voxpopuli) model using NST and Common Voice data. Evalutation without a language model gives the following: WER for NST + Common Voice test set (2% of total sentences) is **5.19%**, WER for Common Voice test set is **17.38%**.