End of training

Browse files

Files changed (3) hide show

README.md +20 -22
model.safetensors +1 -1
runs/May12_18-22-06_1d4a7576e075/events.out.tfevents.1715538284.1d4a7576e075.1900.1 +2 -2

README.md CHANGED Viewed

@@ -1,8 +1,6 @@
 ---
-license: mit
 tags:
 - generated_from_trainer
-base_model: facebook/w2v-bert-2.0
 datasets:
 - common_voice_17_0
 metrics:
@@ -11,8 +9,8 @@ model-index:
 - name: w2v2-bert-urdu
   results:
   - task:
-      type: automatic-speech-recognition
       name: Automatic Speech Recognition
     dataset:
       name: common_voice_17_0
       type: common_voice_17_0
@@ -20,9 +18,9 @@ model-index:
       split: test[:100]
       args: ur
     metrics:
-    - type: wer
-      value: 0.6273224043715847
-      name: Wer
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -30,10 +28,10 @@ should probably proofread and complete it, then remove this comment. -->
 # w2v2-bert-urdu
-This model is a fine-tuned version of [facebook/w2v-bert-2.0](https://huggingface.co/facebook/w2v-bert-2.0) on the common_voice_17_0 dataset.
 It achieves the following results on the evaluation set:
-- Loss: 1.1498
-- Wer: 0.6273
 ## Model description
@@ -52,7 +50,7 @@ More information needed
 ### Training hyperparameters
 The following hyperparameters were used during training:
-- learning_rate: 5e-05
 - train_batch_size: 16
 - eval_batch_size: 8
 - seed: 42
@@ -60,7 +58,7 @@ The following hyperparameters were used during training:
 - total_train_batch_size: 32
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
-- lr_scheduler_warmup_steps: 500
 - num_epochs: 2
 - mixed_precision_training: Native AMP
@@ -68,17 +66,17 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch  | Step | Validation Loss | Wer    |
 |:-------------:|:------:|:----:|:---------------:|:------:|
-| 3.5968        | 0.1695 | 50   | 3.1737          | 1.0    |
-| 3.1414        | 0.3390 | 100  | 2.9666          | 1.0    |
-| 2.3694        | 0.5085 | 150  | 1.0788          | 0.6525 |
-| 0.7692        | 0.6780 | 200  | 0.5647          | 0.4186 |
-| 0.5488        | 0.8475 | 250  | 0.4491          | 0.3486 |
-| 0.5568        | 1.0169 | 300  | 0.5883          | 0.7388 |
-| 0.7925        | 1.1864 | 350  | 1.0338          | 0.7967 |
-| 1.4791        | 1.3559 | 400  | 1.1474          | 0.6251 |
-| 1.2758        | 1.5254 | 450  | 1.1359          | 0.6251 |
-| 1.2763        | 1.6949 | 500  | 1.1497          | 0.6273 |
-| 1.2789        | 1.8644 | 550  | 1.1498          | 0.6273 |
 ### Framework versions

 ---
 tags:
 - generated_from_trainer
 datasets:
 - common_voice_17_0
 metrics:
 - name: w2v2-bert-urdu
   results:
   - task:
       name: Automatic Speech Recognition
+      type: automatic-speech-recognition
     dataset:
       name: common_voice_17_0
       type: common_voice_17_0
       split: test[:100]
       args: ur
     metrics:
+    - name: Wer
+      type: wer
+      value: 0.3300546448087432
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 # w2v2-bert-urdu
+This model was trained from scratch on the common_voice_17_0 dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.4246
+- Wer: 0.3301
 ## Model description
 ### Training hyperparameters
 The following hyperparameters were used during training:
+- learning_rate: 5e-06
 - train_batch_size: 16
 - eval_batch_size: 8
 - seed: 42
 - total_train_batch_size: 32
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
+- lr_scheduler_warmup_steps: 100
 - num_epochs: 2
 - mixed_precision_training: Native AMP
 | Training Loss | Epoch  | Step | Validation Loss | Wer    |
 |:-------------:|:------:|:----:|:---------------:|:------:|
+| 0.8145        | 0.1695 | 50   | 0.4620          | 0.3421 |
+| 0.4364        | 0.3390 | 100  | 0.3969          | 0.2874 |
+| 0.418         | 0.5085 | 150  | 0.3697          | 0.2820 |
+| 0.402         | 0.6780 | 200  | 0.3627          | 0.2842 |
+| 0.3698        | 0.8475 | 250  | 0.3314          | 0.2710 |
+| 0.3779        | 1.0169 | 300  | 0.3292          | 0.2852 |
+| 0.3167        | 1.1864 | 350  | 0.3230          | 0.2820 |
+| 0.3578        | 1.3559 | 400  | 0.3825          | 0.2940 |
+| 0.4189        | 1.5254 | 450  | 0.4225          | 0.3104 |
+| 0.4803        | 1.6949 | 500  | 0.4248          | 0.3311 |
+| 0.4612        | 1.8644 | 550  | 0.4246          | 0.3301 |
 ### Framework versions

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:e417117fa0048ba9cedbf7a4f24527ac201450dc1a3d80837e9c08ef610bc036
 size 2423081060

 version https://git-lfs.github.com/spec/v1
+oid sha256:17747e10300fac022a134b42d307ebdfd5b00ee0b2b350da2fd17ec67ff66fab
 size 2423081060

runs/May12_18-22-06_1d4a7576e075/events.out.tfevents.1715538284.1d4a7576e075.1900.1 CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:77a6d64d4a665cc983545fb85583b1075f66a97d374248f688c6b8fe8ae4d4cc
-size 10131

 version https://git-lfs.github.com/spec/v1
+oid sha256:f86c39156283e6a1025f9d70bc4fed254fbc60859204ec04daff0b4e2f58dabb
+size 12072