juanjucm
/

whisper-large-v3-turbo-OpenHQ-GL

Automatic Speech Recognition

Generated from Trainer

Inference Endpoints

Model card Files Files and versions Metrics Training metrics Community

juanjucm commited on Jan 13

Commit

a2c5972

·

verified ·

1 Parent(s): 4236973

Update README.md

Files changed (1) hide show

README.md +11 -7

README.md CHANGED Viewed

@@ -16,19 +16,23 @@ language:
 - gl
 ---
-<!-- This model card has been generated automatically according to the information the Trainer had access to. You
-should probably proofread and complete it, then remove this comment. -->
 # whisper-large-v3-turbo-OpenSLR-GL
-This model is a fine-tuned version of [openai/whisper-large-v3-turbo](https://huggingface.co/openai/whisper-large-v3-turbo) on [juanjucm/OpenSLR-SpeechT-GL-EN](https://huggingface.co/datasets/juanjucm/OpenSLR-SpeechT-GL-EN).
-It achieves the following results on the evaluation set:
 - Loss: 0.1613
 - Wer: 10.6845
-### Training hyperparameters
 The following hyperparameters were used during training:
 - learning_rate: 3e-06
 - train_batch_size: 16
 - eval_batch_size: 8

 - gl
 ---
 # whisper-large-v3-turbo-OpenSLR-GL
+This model is a fine-tuned version of [openai/whisper-large-v3-turbo](https://huggingface.co/openai/whisper-large-v3-turbo) trained on [juanjucm/OpenSLR-SpeechT-GL-EN](https://huggingface.co/datasets/juanjucm/OpenSLR-SpeechT-GL-EN).  for **Galician Text to Speech** task. It takes galician speech audios as input and generates the correspondant transcription.
+This Automatic Speech Recognition model, was developed to be the first stage of a Speech Translation cascade system for transcribing and translating Galician audios into English texts. After this first STT step, this [Galician-to-English MT model](https://huggingface.co/juanjucm/nllb-200-distilled-600M-OpenSLR-GL-EN) can applied over the generated Galician transcriptions to get English text translations.
+The motivation behind this work is to increase the visibility of the Galician language, making it more accessible for non-Galician speakers to understand and engage with Galician audio content.
+This model was developed during a 3-week Speech Translation workshop organised by [Yasmin Moslem](https://huggingface.co/ymoslem).
+### Performance and training details
+This model was developed during a 3-week Speech Translation workshop organised by [Yasmin Moslem](https://huggingface.co/ymoslem).
 - Loss: 0.1613
 - Wer: 10.6845
 The following hyperparameters were used during training:
 - learning_rate: 3e-06
 - train_batch_size: 16
 - eval_batch_size: 8