juanjucm commited on
Commit
3203344
·
verified ·
1 Parent(s): d0051d2

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +11 -6
README.md CHANGED
@@ -10,6 +10,9 @@ metrics:
10
  model-index:
11
  - name: whisper-small-GL-EN
12
  results: []
 
 
 
13
  ---
14
 
15
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -17,11 +20,13 @@ should probably proofread and complete it, then remove this comment. -->
17
 
18
  # whisper-small-GL-EN
19
 
20
- This model is a fine-tuned version of [openai/whisper-small](https://huggingface.co/openai/whisper-small) on an unknown dataset.
21
- It achieves the following results on the evaluation set:
22
- - Loss: 2.3405
23
- - Wer: 71.9621
24
- - Bleu: 20.1999
 
 
25
 
26
  ## Model description
27
 
@@ -72,4 +77,4 @@ The following hyperparameters were used during training:
72
  - Transformers 4.45.1
73
  - Pytorch 2.4.1+cu121
74
  - Datasets 3.0.1
75
- - Tokenizers 0.20.0
 
10
  model-index:
11
  - name: whisper-small-GL-EN
12
  results: []
13
+ datasets:
14
+ - juanjucm/FLEURS-SpeechT-GL-EN
15
+ - juanjucm/OpenSLR-SpeechT-GL-EN
16
  ---
17
 
18
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 
20
 
21
  # whisper-small-GL-EN
22
 
23
+ This model is a fine-tuned version of [openai/whisper-small](https://huggingface.co/openai/whisper-small) on [juanjucm/FLEURS-SpeechT-GL-EN](https://huggingface.co/datasets/juanjucm/FLEURS-SpeechT-GL-EN).
24
+ The training dataset has been augmented using train split from [juanjucm/OpenSLR-SpeechT-GL-EN](https://huggingface.co/datasets/juanjucm/OpenSLR-SpeechT-GL-EN)
25
+
26
+ It achieves the following results on the evaluation set (evaluated only on [juanjucm/FLEURS-SpeechT-GL-EN](https://huggingface.co/datasets/juanjucm/FLEURS-SpeechT-GL-EN)):
27
+ - Loss: 1.6335
28
+ - Wer: 67.2612
29
+ - Bleu: 22.2158
30
 
31
  ## Model description
32
 
 
77
  - Transformers 4.45.1
78
  - Pytorch 2.4.1+cu121
79
  - Datasets 3.0.1
80
+ - Tokenizers 0.20.0