kiranpantha's picture
End of training
db76739 verified
metadata
library_name: transformers
language:
  - ne
license: apache-2.0
base_model: openai/whisper-medium
tags:
  - generated_from_trainer
datasets:
  - kiranpantha/OpenSLR54-Whisper
metrics:
  - wer
model-index:
  - name: Whisper Medium Nepali - Kiran Pantha
    results:
      - task:
          name: Automatic Speech Recognition
          type: automatic-speech-recognition
        dataset:
          name: OpenSLR54
          type: kiranpantha/OpenSLR54-Whisper
          config: default
          split: test
          args: 'config: ne, split: test'
        metrics:
          - name: Wer
            type: wer
            value: 21.99074074074074

Whisper Medium Nepali - Kiran Pantha

This model is a fine-tuned version of openai/whisper-medium on the OpenSLR54 dataset. It achieves the following results on the evaluation set:

  • Loss: 0.1608
  • Wer: 21.9907
  • Cer: 5.3068

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

  • learning_rate: 1e-05
  • train_batch_size: 16
  • eval_batch_size: 8
  • seed: 42
  • optimizer: Use adamw_torch with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
  • lr_scheduler_type: linear
  • lr_scheduler_warmup_steps: 500
  • training_steps: 5000
  • mixed_precision_training: Native AMP

Training results

Training Loss Epoch Step Cer Validation Loss Wer
0.2877 0.3597 300 13.1055 0.2588 48.6574
0.1865 0.7194 600 10.0741 0.2052 39.9537
0.1155 1.0791 900 8.1217 0.1633 31.8981
0.0992 1.4388 1200 7.6923 0.1577 30.6250
0.094 1.7986 1500 7.1969 0.1418 28.3796
0.0454 2.1583 1800 6.7308 0.1472 26.6898
0.0333 2.5180 2100 6.6353 0.1512 27.0602
0.0446 2.8777 2400 6.25 0.1409 25.5556
0.0204 3.2374 2700 6.5399 0.1513 25.8796
0.016 3.5971 3000 5.9674 0.1560 24.8380
0.0166 3.9568 3300 6.0592 0.1573 25.0926
0.0052 4.3165 3600 5.7802 0.1566 23.3796
0.0068 4.6763 3900 5.5233 0.1544 22.7083
0.0013 5.0360 4200 5.4756 0.1568 22.7546
0.001 5.3957 4500 0.1606 22.0139 5.3178
0.0017 5.7554 4800 0.1608 21.9907 5.3068

Framework versions

  • Transformers 4.46.3
  • Pytorch 2.5.1+cxx11.abi
  • Datasets 3.2.0
  • Tokenizers 0.20.3