kiranpantha
/

whisper-large-v3-nepali

@@ -8,9 +8,24 @@ tags:
 - generated_from_trainer
 datasets:
 - kiranpantha/OpenSLR54-Balanced-Nepali
 model-index:
 - name: Whisper Large v3  Nepali - Kiran Pantha
-  results: []
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -19,6 +34,10 @@ should probably proofread and complete it, then remove this comment. -->
 # Whisper Large v3  Nepali - Kiran Pantha
 This model is a fine-tuned version of [openai/whisper-large-v3](https://huggingface.co/openai/whisper-large-v3) on the OpenSLR54 dataset.
 ## Model description
@@ -38,7 +57,7 @@ More information needed
 The following hyperparameters were used during training:
 - learning_rate: 1e-05
-- train_batch_size: 8
 - eval_batch_size: 8
 - seed: 42
 - optimizer: Use adamw_torch with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
@@ -47,6 +66,28 @@ The following hyperparameters were used during training:
 - training_steps: 5000
 - mixed_precision_training: Native AMP
 ### Framework versions
 - Transformers 4.46.3

 - generated_from_trainer
 datasets:
 - kiranpantha/OpenSLR54-Balanced-Nepali
+metrics:
+- wer
 model-index:
 - name: Whisper Large v3  Nepali - Kiran Pantha
+  results:
+  - task:
+      name: Automatic Speech Recognition
+      type: automatic-speech-recognition
+    dataset:
+      name: OpenSLR54
+      type: kiranpantha/OpenSLR54-Balanced-Nepali
+      config: default
+      split: test
+      args: 'config: ne, split: test'
+    metrics:
+    - name: Wer
+      type: wer
+      value: 20.48611111111111
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 # Whisper Large v3  Nepali - Kiran Pantha
 This model is a fine-tuned version of [openai/whisper-large-v3](https://huggingface.co/openai/whisper-large-v3) on the OpenSLR54 dataset.
+It achieves the following results on the evaluation set:
+- Loss: 0.1507
+- Wer: 20.4861
+- Cer: 4.9839
 ## Model description
 The following hyperparameters were used during training:
 - learning_rate: 1e-05
+- train_batch_size: 16
 - eval_batch_size: 8
 - seed: 42
 - optimizer: Use adamw_torch with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
 - training_steps: 5000
 - mixed_precision_training: Native AMP
+### Training results
+| Training Loss | Epoch  | Step | Cer     | Validation Loss | Wer     |
+|:-------------:|:------:|:----:|:-------:|:---------------:|:-------:|
+| 0.209         | 0.3597 | 300  | 10.9916 | 0.2021          | 41.1574 |
+| 0.1714        | 0.7194 | 600  | 8.9474  | 0.1755          | 35.7176 |
+| 0.101         | 1.0791 | 900  | 7.5565  | 0.1485          | 29.6991 |
+| 0.0902        | 1.4388 | 1200 | 7.2372  | 0.1396          | 28.2407 |
+| 0.0872        | 1.7986 | 1500 | 7.8024  | 0.1319          | 27.9861 |
+| 0.0453        | 2.1583 | 1800 | 6.3344  | 0.1374          | 26.2269 |
+| 0.0368        | 2.5180 | 2100 | 6.1766  | 0.1381          | 25.2315 |
+| 0.0472        | 2.8777 | 2400 | 5.8316  | 0.1316          | 24.1435 |
+| 0.0191        | 3.2374 | 2700 | 5.8059  | 0.1356          | 24.0278 |
+| 0.0185        | 3.5971 | 3000 | 5.5674  | 0.1376          | 23.125  |
+| 0.0182        | 3.9568 | 3300 | 5.5123  | 0.1360          | 23.0556 |
+| 0.0074        | 4.3165 | 3600 | 5.2077  | 0.1428          | 21.7130 |
+| 0.0086        | 4.6763 | 3900 | 5.1784  | 0.1433          | 21.2731 |
+| 0.0031        | 5.0360 | 4200 | 0.1421  | 21.1806         | 5.0279  |
+| 0.0024        | 5.3957 | 4500 | 0.1482  | 20.7870         | 4.9912  |
+| 0.0014        | 5.7554 | 4800 | 0.1507  | 20.4861         | 4.9839  |
 ### Framework versions
 - Transformers 4.46.3

model-00001-of-00002.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:e6187e6a7d1fda04207aac48841966dc014cf786a85bf1d709fb62423a04fa32
 size 4993448880

 version https://git-lfs.github.com/spec/v1
+oid sha256:8cb4e3fe5f9c3908229be09f5cfb5d8c9140346bf5bffae7c0bf490c0659beff
 size 4993448880

model-00002-of-00002.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:14b464ab5c81cc798bb0573fd7cba8beb478fe4d6dd0a64787680ab24d9885fa
 size 1180663192

 version https://git-lfs.github.com/spec/v1
+oid sha256:8054c2eb8b5999c0177a927da3e4bf7dbbbe35c218ffffe64c6f4fb29effe984
 size 1180663192