models/whisper-large-v3-cv17-th-ft-with-lr-1e-3

Browse files

Files changed (3) hide show

README.md +18 -13
adapter_model.safetensors +1 -1
runs/Feb26_14-30-05_gpt7/events.out.tfevents.1740555006.gpt7.338639.0 +2 -2

README.md CHANGED Viewed

@@ -24,7 +24,7 @@ model-index:
       args: 'config: th, split: train'
     metrics:
     - type: wer
-      value: 132.2683744748853
       name: Wer
 ---
@@ -35,9 +35,9 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [openai/whisper-large-v3](https://huggingface.co/openai/whisper-large-v3) on the Common Voice 17.0 dataset.
 It achieves the following results on the evaluation set:
-- Loss: 3.1322
-- Cer: 114.9591
-- Wer: 132.2684
 ## Model description
@@ -56,25 +56,30 @@ More information needed
 ### Training hyperparameters
 The following hyperparameters were used during training:
-- learning_rate: 0.002
 - train_batch_size: 16
 - eval_batch_size: 16
 - seed: 42
 - optimizer: Use paged_adamw_8bit with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
 - lr_scheduler_type: linear
 - lr_scheduler_warmup_steps: 500
-- training_steps: 10000
 - mixed_precision_training: Native AMP
 ### Training results
-| Training Loss | Epoch  | Step  | Validation Loss | Cer      | Wer      |
-|:-------------:|:------:|:-----:|:---------------:|:--------:|:--------:|
-| 3.4988        | 1.0    | 2052  | 3.4636          | 150.2137 | 149.0381 |
-| 3.3477        | 2.0    | 4104  | 3.3319          | 166.3704 | 123.2261 |
-| 3.1868        | 3.0    | 6156  | 3.2054          | 122.7370 | 133.0710 |
-| 3.1487        | 4.0    | 8208  | 3.1660          | 155.4132 | 127.6764 |
-| 3.1219        | 4.8733 | 10000 | 3.1322          | 114.9591 | 132.2684 |
 ### Framework versions

       args: 'config: th, split: train'
     metrics:
     - type: wer
+      value: 77.07049551458215
       name: Wer
 ---
 This model is a fine-tuned version of [openai/whisper-large-v3](https://huggingface.co/openai/whisper-large-v3) on the Common Voice 17.0 dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.1953
+- Cer: 49.5719
+- Wer: 77.0705
 ## Model description
 ### Training hyperparameters
 The following hyperparameters were used during training:
+- learning_rate: 0.0002
 - train_batch_size: 16
 - eval_batch_size: 16
 - seed: 42
 - optimizer: Use paged_adamw_8bit with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
 - lr_scheduler_type: linear
 - lr_scheduler_warmup_steps: 500
+- num_epochs: 10
 - mixed_precision_training: Native AMP
 ### Training results
+| Training Loss | Epoch | Step  | Validation Loss | Cer      | Wer      |
+|:-------------:|:-----:|:-----:|:---------------:|:--------:|:--------:|
+| 0.158         | 1.0   | 2052  | 0.1361          | 288.0891 | 56.6543  |
+| 0.1166        | 2.0   | 4104  | 0.1296          | 232.8857 | 57.0405  |
+| 0.0739        | 3.0   | 6156  | 0.1298          | 107.5882 | 42.6799  |
+| 0.0467        | 4.0   | 8208  | 0.1370          | 61.4281  | 43.2440  |
+| 0.0263        | 5.0   | 10260 | 0.1424          | 69.8588  | 80.9373  |
+| 0.012         | 6.0   | 12312 | 0.1607          | 56.1242  | 68.8006  |
+| 0.0081        | 7.0   | 14364 | 0.1649          | 130.1459 | 76.1770  |
+| 0.0036        | 8.0   | 16416 | 0.1782          | 110.6614 | 116.6100 |
+| 0.002         | 9.0   | 18468 | 0.1862          | 50.2801  | 85.8464  |
+| 0.0005        | 10.0  | 20520 | 0.1953          | 49.5719  | 77.0705  |
 ### Framework versions

adapter_model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:7b1bd9481d549b569c190cb711f26bbab2dbb20a7bea433bd45011ea0d43c3e3
 size 251714264

 version https://git-lfs.github.com/spec/v1
+oid sha256:73588fc31a3f3b7850f5882ea383672d0ffa936a93e0d0035ec14d3dc00481c5
 size 251714264

runs/Feb26_14-30-05_gpt7/events.out.tfevents.1740555006.gpt7.338639.0 CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:dc8a91e09144b47c86dabb66aa37667935f1535c90ae83caef425b8a139ff026
-size 96554

 version https://git-lfs.github.com/spec/v1
+oid sha256:0ef743baed8b2f5d67d2933793c315e03e224c95d76c412e221867f82402b238
+size 97286