nurzhanit commited on
Commit
2c372a2
·
verified ·
1 Parent(s): b845601

End of training

Browse files
Files changed (3) hide show
  1. README.md +88 -0
  2. generation_config.json +14 -0
  3. model.safetensors +1 -1
README.md ADDED
@@ -0,0 +1,88 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ language:
3
+ - hi
4
+ base_model: nurzhanit/whisper-enhanced-ml
5
+ tags:
6
+ - hf-asr-leaderboard
7
+ - generated_from_trainer
8
+ datasets:
9
+ - mozilla-foundation/common_voice_11_0
10
+ metrics:
11
+ - wer
12
+ model-index:
13
+ - name: Whisper Small Hi - Sanchit Gandhi
14
+ results:
15
+ - task:
16
+ name: Automatic Speech Recognition
17
+ type: automatic-speech-recognition
18
+ dataset:
19
+ name: Common Voice 11.0
20
+ type: mozilla-foundation/common_voice_11_0
21
+ config: default
22
+ split: None
23
+ args: 'config: hi, split: test'
24
+ metrics:
25
+ - name: Wer
26
+ type: wer
27
+ value: 0.05155525004296271
28
+ ---
29
+
30
+ <!-- This model card has been generated automatically according to the information the Trainer had access to. You
31
+ should probably proofread and complete it, then remove this comment. -->
32
+
33
+ # Whisper Small Hi - Sanchit Gandhi
34
+
35
+ This model is a fine-tuned version of [nurzhanit/whisper-enhanced-ml](https://huggingface.co/nurzhanit/whisper-enhanced-ml) on the Common Voice 11.0 dataset.
36
+ It achieves the following results on the evaluation set:
37
+ - Loss: 0.0005
38
+ - Wer: 0.0516
39
+
40
+ ## Model description
41
+
42
+ More information needed
43
+
44
+ ## Intended uses & limitations
45
+
46
+ More information needed
47
+
48
+ ## Training and evaluation data
49
+
50
+ More information needed
51
+
52
+ ## Training procedure
53
+
54
+ ### Training hyperparameters
55
+
56
+ The following hyperparameters were used during training:
57
+ - learning_rate: 1e-05
58
+ - train_batch_size: 16
59
+ - eval_batch_size: 8
60
+ - seed: 42
61
+ - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
62
+ - lr_scheduler_type: linear
63
+ - lr_scheduler_warmup_steps: 25
64
+ - training_steps: 500
65
+ - mixed_precision_training: Native AMP
66
+
67
+ ### Training results
68
+
69
+ | Training Loss | Epoch | Step | Validation Loss | Wer |
70
+ |:-------------:|:-----:|:----:|:---------------:|:------:|
71
+ | 0.0929 | 1.0 | 50 | 0.0404 | 8.1114 |
72
+ | 0.0385 | 2.0 | 100 | 0.0161 | 3.0761 |
73
+ | 0.0176 | 3.0 | 150 | 0.0081 | 1.4779 |
74
+ | 0.0087 | 4.0 | 200 | 0.0034 | 0.6358 |
75
+ | 0.0038 | 5.0 | 250 | 0.0020 | 0.4296 |
76
+ | 0.0025 | 6.0 | 300 | 0.0011 | 0.1203 |
77
+ | 0.0017 | 7.0 | 350 | 0.0008 | 0.0516 |
78
+ | 0.0009 | 8.0 | 400 | 0.0006 | 0.0516 |
79
+ | 0.0007 | 9.0 | 450 | 0.0006 | 0.0516 |
80
+ | 0.0007 | 10.0 | 500 | 0.0005 | 0.0516 |
81
+
82
+
83
+ ### Framework versions
84
+
85
+ - Transformers 4.40.0
86
+ - Pytorch 2.5.0+cu124
87
+ - Datasets 3.0.2
88
+ - Tokenizers 0.19.1
generation_config.json ADDED
@@ -0,0 +1,14 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "begin_suppress_tokens": [
3
+ 220,
4
+ 50257
5
+ ],
6
+ "bos_token_id": 50257,
7
+ "decoder_start_token_id": 50258,
8
+ "eos_token_id": 50257,
9
+ "max_length": 448,
10
+ "pad_token_id": 50257,
11
+ "return_timestamps": false,
12
+ "suppress_tokens": [],
13
+ "transformers_version": "4.40.0"
14
+ }
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:92ec0c4f6136f8a8c4049ca1bb1364f97908f1cf50f586819d8d05d3ce6074b9
3
  size 966995080
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:208d07ff37ba63b2a98dbb51bd71f5b597c627abcbeeec308cf8a0dade91978e
3
  size 966995080