--- base_model: openai/whisper-small datasets: - mozilla-foundation/common_voice_17_0 language: sw library_name: transformers license: apache-2.0 model-index: - name: Finetuned openai/whisper-small on Swahili results: - task: type: automatic-speech-recognition name: Speech-to-Text dataset: name: Common Voice (Swahili) type: common_voice metrics: - type: wer value: 43.876 --- # Finetuned openai/whisper-small on 58000 Swahili training audio samples from mozilla-foundation/common_voice_17_0. This model was created from the Mozilla.ai Blueprint: [speech-to-text-finetune](https://github.com/mozilla-ai/speech-to-text-finetune). ## Evaluation results on 12253 audio samples of Swahili: ### Baseline model (before finetuning) on Swahili - Word Error Rate: 133.795 - Loss: 2.459 ### Finetuned model (after finetuning) on Swahili - Word Error Rate: 43.876 - Loss: 0.653