Marco-Cheung
/

whisper-small-cantonese

Automatic Speech Recognition

Generated from Trainer

Model card Files Files and versions Community

Marco-Cheung commited on Aug 9, 2023

Commit

ac8befd

·

1 Parent(s): 165e96d

End of training

Files changed (2) hide show

README.md +11 -11
generation_config.json +2 -2

README.md CHANGED Viewed

@@ -24,7 +24,7 @@ model-index:
     metrics:
     - name: Wer
       type: wer
-      value: 114.19385194479297
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -34,9 +34,9 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [openai/whisper-small](https://huggingface.co/openai/whisper-small) on the Common Voice 13 dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.2812
-- Wer Ortho: 116.9664
-- Wer: 114.1939
 ## Model description
@@ -57,19 +57,19 @@ More information needed
 The following hyperparameters were used during training:
 - learning_rate: 1e-05
 - train_batch_size: 16
-- eval_batch_size: 8
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: constant_with_warmup
-- lr_scheduler_warmup_steps: 50
-- training_steps: 1000
 ### Training results
-| Training Loss | Epoch | Step | Validation Loss | Wer Ortho | Wer      |
-|:-------------:|:-----:|:----:|:---------------:|:---------:|:--------:|
-| 0.3111        | 0.57  | 500  | 0.3072          | 92.3269   | 105.6462 |
-| 0.1729        | 1.14  | 1000 | 0.2812          | 116.9664  | 114.1939 |
 ### Framework versions

     metrics:
     - name: Wer
       type: wer
+      value: 57.700752823086574
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 This model is a fine-tuned version of [openai/whisper-small](https://huggingface.co/openai/whisper-small) on the Common Voice 13 dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.2487
+- Wer Ortho: 57.8423
+- Wer: 57.7008
 ## Model description
 The following hyperparameters were used during training:
 - learning_rate: 1e-05
 - train_batch_size: 16
+- eval_batch_size: 16
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: constant_with_warmup
+- lr_scheduler_warmup_steps: 10
+- training_steps: 2000
 ### Training results
+| Training Loss | Epoch | Step | Validation Loss | Wer Ortho | Wer     |
+|:-------------:|:-----:|:----:|:---------------:|:---------:|:-------:|
+| 0.1621        | 1.14  | 1000 | 0.2587          | 61.0824   | 65.0094 |
+| 0.0767        | 2.28  | 2000 | 0.2487          | 57.8423   | 57.7008 |
 ### Framework versions

generation_config.json CHANGED Viewed

@@ -51,7 +51,7 @@
   "forced_decoder_ids": [
     [
       1,
-      50322
     ],
     [
       2,
@@ -164,7 +164,7 @@
     "<|yo|>": 50325,
     "<|zh|>": 50260
   },
-  "language": "sinhalese",
   "max_initial_timestamp_index": 1,
   "max_length": 448,
   "no_timestamps_token_id": 50363,

   "forced_decoder_ids": [
     [
       1,
+      50260
     ],
     [
       2,
     "<|yo|>": 50325,
     "<|zh|>": 50260
   },
+  "language": "chinese",
   "max_initial_timestamp_index": 1,
   "max_length": 448,
   "no_timestamps_token_id": 50363,