nvidia
/

stt_ua_fastconformer_hybrid_large_pc

Automatic Speech Recognition

hf-asr-leaderboard

Model card Files Files and versions Community

Update README.md

#1

by nkarpov - opened Jun 12, 2023

base: refs/heads/main

←

from: refs/pr/1

Discussion Files changed

Files changed (1) hide show

README.md +5 -5

README.md CHANGED Viewed

@@ -44,14 +44,14 @@ model-index:
       config: Ukrainian P&C
       split: test
       args:
-        language: uk
     metrics:
     - name: Test WER P&C
       type: wer
       value: 6.92
 ---
-# NVIDIA FastConformer-Hybrid Large (uk)
 <style>
 img {
@@ -83,7 +83,7 @@ The model is available for use in the NeMo toolkit [3], and can be used as a pre
 ```python
 import nemo.collections.asr as nemo_asr
-asr_model = nemo_asr.models.EncDecHybridRNNTCTCBPEModel.from_pretrained(model_name="nvidia/stt_uk_fastconformer_hybrid_large_pc")
 ```
 ### Transcribing using Python
@@ -101,14 +101,14 @@ asr_model.transcribe(['2086-149220-0033.wav'])
 Using Transducer mode inference:
 ```shell
 python [NEMO_GIT_FOLDER]/examples/asr/transcribe_speech.py
- pretrained_name="nvidia/stt_uk_fastconformer_hybrid_large_pc"
  audio_dir="<DIRECTORY CONTAINING AUDIO FILES>"
 ```
 Using CTC mode inference:
 ```shell
 python [NEMO_GIT_FOLDER]/examples/asr/transcribe_speech.py
- pretrained_name="nvidia/stt_uk_fastconformer_hybrid_large_pc"
  audio_dir="<DIRECTORY CONTAINING AUDIO FILES>"
  decoder_type="ctc"
 ```

       config: Ukrainian P&C
       split: test
       args:
+        language: ua
     metrics:
     - name: Test WER P&C
       type: wer
       value: 6.92
 ---
+# NVIDIA FastConformer-Hybrid Large (ua)
 <style>
 img {
 ```python
 import nemo.collections.asr as nemo_asr
+asr_model = nemo_asr.models.EncDecHybridRNNTCTCBPEModel.from_pretrained(model_name="nvidia/stt_ua_fastconformer_hybrid_large_pc")
 ```
 ### Transcribing using Python
 Using Transducer mode inference:
 ```shell
 python [NEMO_GIT_FOLDER]/examples/asr/transcribe_speech.py
+ pretrained_name="nvidia/stt_ua_fastconformer_hybrid_large_pc"
  audio_dir="<DIRECTORY CONTAINING AUDIO FILES>"
 ```
 Using CTC mode inference:
 ```shell
 python [NEMO_GIT_FOLDER]/examples/asr/transcribe_speech.py
+ pretrained_name="nvidia/stt_ua_fastconformer_hybrid_large_pc"
  audio_dir="<DIRECTORY CONTAINING AUDIO FILES>"
  decoder_type="ctc"
 ```