adrianSauer
/

wav2vec2-cer

Automatic Speech Recognition

Generated from Trainer

Model card Files Files and versions Metrics Training metrics Community

adrianSauer commited on Apr 29, 2024

Commit

71ba1be

·

verified ·

1 Parent(s): 760c568

End of training

Files changed (1) hide show

README.md +15 -15

README.md CHANGED Viewed

@@ -8,19 +8,19 @@ tags:
 datasets:
 - mozilla-foundation/common_voice_16_1
 model-index:
-- name: Common Voice 16-10
   results: []
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
-# Common Voice 16-10
 This model is a fine-tuned version of [glob-asr/wav2vec2-large-xls-r-300m-guarani-small](https://huggingface.co/glob-asr/wav2vec2-large-xls-r-300m-guarani-small) on the Common Voice 16 dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.3185
-- Cer: 7.1509
 ## Model description
@@ -45,24 +45,24 @@ The following hyperparameters were used during training:
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: constant_with_warmup
-- lr_scheduler_warmup_steps: 1
 - training_steps: 500
 - mixed_precision_training: Native AMP
 ### Training results
-| Training Loss | Epoch | Step | Validation Loss | Cer    |
-|:-------------:|:-----:|:----:|:---------------:|:------:|
-| 0.3978        | 1.01  | 100  | 0.3420          | 8.0058 |
-| 0.3263        | 2.02  | 200  | 0.3318          | 7.7482 |
-| 0.28          | 3.03  | 300  | 0.3226          | 7.4710 |
-| 0.2505        | 4.04  | 400  | 0.3173          | 7.2758 |
-| 0.2427        | 5.05  | 500  | 0.3185          | 7.1509 |
 ### Framework versions
-- Transformers 4.38.2
 - Pytorch 2.2.1+cu121
-- Datasets 2.18.0
-- Tokenizers 0.15.2

 datasets:
 - mozilla-foundation/common_voice_16_1
 model-index:
+- name: Common Voice 16
   results: []
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
+# Common Voice 16
 This model is a fine-tuned version of [glob-asr/wav2vec2-large-xls-r-300m-guarani-small](https://huggingface.co/glob-asr/wav2vec2-large-xls-r-300m-guarani-small) on the Common Voice 16 dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.3202
+- Cer: 7.2954
 ## Model description
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: constant_with_warmup
+- lr_scheduler_warmup_steps: 50
 - training_steps: 500
 - mixed_precision_training: Native AMP
 ### Training results
+| Training Loss | Epoch  | Step | Validation Loss | Cer    |
+|:-------------:|:------:|:----:|:---------------:|:------:|
+| 0.4174        | 1.0101 | 100  | 0.3535          | 8.1385 |
+| 0.3411        | 2.0202 | 200  | 0.3387          | 7.8574 |
+| 0.2905        | 3.0303 | 300  | 0.3278          | 7.6076 |
+| 0.2591        | 4.0404 | 400  | 0.3214          | 7.3734 |
+| 0.251         | 5.0505 | 500  | 0.3202          | 7.2954 |
 ### Framework versions
+- Transformers 4.40.1
 - Pytorch 2.2.1+cu121
+- Datasets 2.19.0
+- Tokenizers 0.19.1