NeonBohdan's picture
Model with 1024 vocab
7ff5ca0
metadata
language:
  - de
library_name: nemo
datasets:
  - mozilla-foundation/common_voice_12_0
tags:
  - automatic-speech-recognition
model-index:
  - name: stt_de_citrinet_512_gamma_0_25
    results:
      - task:
          name: Automatic Speech Recognition
          type: automatic-speech-recognition
        dataset:
          name: Mozilla Common Voice 12.0
          type: mozilla-foundation/common_voice_12_0
          config: clean
          split: test
          args:
            language: de
        metrics:
          - name: Test WER
            type: wer
            value: 11.1
license: bsd-3-clause

NVIDIA Streaming Citrinet 512 (de-DE)

| Model architecture | Model size | Language |

Attribution

As initial checkpoint used stt_en_citrinet_512_gamma_0_25 by NVIDIA licensed under CC-BY-4.0