sreyroth commited on
Commit
bdded93
·
verified ·
1 Parent(s): 85bb757

models/whisper-large-v3-cv17-th-ft-with-lr-1e-3

Browse files
README.md CHANGED
@@ -24,7 +24,7 @@ model-index:
24
  args: 'config: th, split: train'
25
  metrics:
26
  - type: wer
27
- value: 132.2683744748853
28
  name: Wer
29
  ---
30
 
@@ -35,9 +35,9 @@ should probably proofread and complete it, then remove this comment. -->
35
 
36
  This model is a fine-tuned version of [openai/whisper-large-v3](https://huggingface.co/openai/whisper-large-v3) on the Common Voice 17.0 dataset.
37
  It achieves the following results on the evaluation set:
38
- - Loss: 3.1322
39
- - Cer: 114.9591
40
- - Wer: 132.2684
41
 
42
  ## Model description
43
 
@@ -56,25 +56,30 @@ More information needed
56
  ### Training hyperparameters
57
 
58
  The following hyperparameters were used during training:
59
- - learning_rate: 0.002
60
  - train_batch_size: 16
61
  - eval_batch_size: 16
62
  - seed: 42
63
  - optimizer: Use paged_adamw_8bit with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
64
  - lr_scheduler_type: linear
65
  - lr_scheduler_warmup_steps: 500
66
- - training_steps: 10000
67
  - mixed_precision_training: Native AMP
68
 
69
  ### Training results
70
 
71
- | Training Loss | Epoch | Step | Validation Loss | Cer | Wer |
72
- |:-------------:|:------:|:-----:|:---------------:|:--------:|:--------:|
73
- | 3.4988 | 1.0 | 2052 | 3.4636 | 150.2137 | 149.0381 |
74
- | 3.3477 | 2.0 | 4104 | 3.3319 | 166.3704 | 123.2261 |
75
- | 3.1868 | 3.0 | 6156 | 3.2054 | 122.7370 | 133.0710 |
76
- | 3.1487 | 4.0 | 8208 | 3.1660 | 155.4132 | 127.6764 |
77
- | 3.1219 | 4.8733 | 10000 | 3.1322 | 114.9591 | 132.2684 |
 
 
 
 
 
78
 
79
 
80
  ### Framework versions
 
24
  args: 'config: th, split: train'
25
  metrics:
26
  - type: wer
27
+ value: 77.07049551458215
28
  name: Wer
29
  ---
30
 
 
35
 
36
  This model is a fine-tuned version of [openai/whisper-large-v3](https://huggingface.co/openai/whisper-large-v3) on the Common Voice 17.0 dataset.
37
  It achieves the following results on the evaluation set:
38
+ - Loss: 0.1953
39
+ - Cer: 49.5719
40
+ - Wer: 77.0705
41
 
42
  ## Model description
43
 
 
56
  ### Training hyperparameters
57
 
58
  The following hyperparameters were used during training:
59
+ - learning_rate: 0.0002
60
  - train_batch_size: 16
61
  - eval_batch_size: 16
62
  - seed: 42
63
  - optimizer: Use paged_adamw_8bit with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
64
  - lr_scheduler_type: linear
65
  - lr_scheduler_warmup_steps: 500
66
+ - num_epochs: 10
67
  - mixed_precision_training: Native AMP
68
 
69
  ### Training results
70
 
71
+ | Training Loss | Epoch | Step | Validation Loss | Cer | Wer |
72
+ |:-------------:|:-----:|:-----:|:---------------:|:--------:|:--------:|
73
+ | 0.158 | 1.0 | 2052 | 0.1361 | 288.0891 | 56.6543 |
74
+ | 0.1166 | 2.0 | 4104 | 0.1296 | 232.8857 | 57.0405 |
75
+ | 0.0739 | 3.0 | 6156 | 0.1298 | 107.5882 | 42.6799 |
76
+ | 0.0467 | 4.0 | 8208 | 0.1370 | 61.4281 | 43.2440 |
77
+ | 0.0263 | 5.0 | 10260 | 0.1424 | 69.8588 | 80.9373 |
78
+ | 0.012 | 6.0 | 12312 | 0.1607 | 56.1242 | 68.8006 |
79
+ | 0.0081 | 7.0 | 14364 | 0.1649 | 130.1459 | 76.1770 |
80
+ | 0.0036 | 8.0 | 16416 | 0.1782 | 110.6614 | 116.6100 |
81
+ | 0.002 | 9.0 | 18468 | 0.1862 | 50.2801 | 85.8464 |
82
+ | 0.0005 | 10.0 | 20520 | 0.1953 | 49.5719 | 77.0705 |
83
 
84
 
85
  ### Framework versions
adapter_model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:7b1bd9481d549b569c190cb711f26bbab2dbb20a7bea433bd45011ea0d43c3e3
3
  size 251714264
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:73588fc31a3f3b7850f5882ea383672d0ffa936a93e0d0035ec14d3dc00481c5
3
  size 251714264
runs/Feb26_14-30-05_gpt7/events.out.tfevents.1740555006.gpt7.338639.0 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:dc8a91e09144b47c86dabb66aa37667935f1535c90ae83caef425b8a139ff026
3
- size 96554
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:0ef743baed8b2f5d67d2933793c315e03e224c95d76c412e221867f82402b238
3
+ size 97286