juanjucm commited on
Commit
a2c5972
·
verified ·
1 Parent(s): 4236973

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +11 -7
README.md CHANGED
@@ -16,19 +16,23 @@ language:
16
  - gl
17
  ---
18
 
19
- <!-- This model card has been generated automatically according to the information the Trainer had access to. You
20
- should probably proofread and complete it, then remove this comment. -->
21
-
22
  # whisper-large-v3-turbo-OpenSLR-GL
23
 
24
- This model is a fine-tuned version of [openai/whisper-large-v3-turbo](https://huggingface.co/openai/whisper-large-v3-turbo) on [juanjucm/OpenSLR-SpeechT-GL-EN](https://huggingface.co/datasets/juanjucm/OpenSLR-SpeechT-GL-EN).
25
- It achieves the following results on the evaluation set:
 
 
 
 
 
 
 
 
26
  - Loss: 0.1613
27
  - Wer: 10.6845
28
 
29
- ### Training hyperparameters
30
-
31
  The following hyperparameters were used during training:
 
32
  - learning_rate: 3e-06
33
  - train_batch_size: 16
34
  - eval_batch_size: 8
 
16
  - gl
17
  ---
18
 
 
 
 
19
  # whisper-large-v3-turbo-OpenSLR-GL
20
 
21
+ This model is a fine-tuned version of [openai/whisper-large-v3-turbo](https://huggingface.co/openai/whisper-large-v3-turbo) trained on [juanjucm/OpenSLR-SpeechT-GL-EN](https://huggingface.co/datasets/juanjucm/OpenSLR-SpeechT-GL-EN). for **Galician Text to Speech** task. It takes galician speech audios as input and generates the correspondant transcription.
22
+
23
+ This Automatic Speech Recognition model, was developed to be the first stage of a Speech Translation cascade system for transcribing and translating Galician audios into English texts. After this first STT step, this [Galician-to-English MT model](https://huggingface.co/juanjucm/nllb-200-distilled-600M-OpenSLR-GL-EN) can applied over the generated Galician transcriptions to get English text translations.
24
+ The motivation behind this work is to increase the visibility of the Galician language, making it more accessible for non-Galician speakers to understand and engage with Galician audio content.
25
+
26
+ This model was developed during a 3-week Speech Translation workshop organised by [Yasmin Moslem](https://huggingface.co/ymoslem).
27
+
28
+ ### Performance and training details
29
+
30
+ This model was developed during a 3-week Speech Translation workshop organised by [Yasmin Moslem](https://huggingface.co/ymoslem).
31
  - Loss: 0.1613
32
  - Wer: 10.6845
33
 
 
 
34
  The following hyperparameters were used during training:
35
+
36
  - learning_rate: 3e-06
37
  - train_batch_size: 16
38
  - eval_batch_size: 8