AdaCodruta
/

wav2vec2_common_voice17_finetuning

@@ -1,4 +1,5 @@
 ---
 license: apache-2.0
 base_model: facebook/wav2vec2-base
 tags:
@@ -8,32 +9,32 @@ datasets:
 metrics:
 - wer
 model-index:
-- name: wav2vec2-romanian-test
   results:
   - task:
-      type: automatic-speech-recognition
       name: Automatic Speech Recognition
     dataset:
       name: common_voice_17_0
       type: common_voice_17_0
       config: ro
-      split: validation
       args: ro
     metrics:
-    - type: wer
-      value: 0.9992270033496522
-      name: Wer
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
-# wav2vec2-romanian-test
 This model is a fine-tuned version of [facebook/wav2vec2-base](https://huggingface.co/facebook/wav2vec2-base) on the common_voice_17_0 dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.7702
-- Wer: 0.9992
 ## Model description
@@ -56,7 +57,7 @@ The following hyperparameters were used during training:
 - train_batch_size: 32
 - eval_batch_size: 8
 - seed: 42
-- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 - lr_scheduler_warmup_steps: 500
 - training_steps: 5000
@@ -66,16 +67,16 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch   | Step | Validation Loss | Wer    |
 |:-------------:|:-------:|:----:|:---------------:|:------:|
-| 0.4402        | 6.2112  | 1000 | 0.8085          | 0.9997 |
-| 0.1779        | 12.4224 | 2000 | 0.8190          | 0.9992 |
-| 0.1242        | 18.6335 | 3000 | 0.7426          | 0.9990 |
-| 0.1003        | 24.8447 | 4000 | 0.7451          | 0.9990 |
-| 0.0862        | 31.0559 | 5000 | 0.7702          | 0.9992 |
 ### Framework versions
-- Transformers 4.40.2
-- Pytorch 2.5.1+cu118
-- Datasets 3.1.0
-- Tokenizers 0.19.1

 ---
+library_name: transformers
 license: apache-2.0
 base_model: facebook/wav2vec2-base
 tags:
 metrics:
 - wer
 model-index:
+- name: wav2vec2_common_voice17_finetuning
   results:
   - task:
       name: Automatic Speech Recognition
+      type: automatic-speech-recognition
     dataset:
       name: common_voice_17_0
       type: common_voice_17_0
       config: ro
+      split: test
       args: ro
     metrics:
+    - name: Wer
+      type: wer
+      value: 1.0
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
+# wav2vec2_common_voice17_finetuning
 This model is a fine-tuned version of [facebook/wav2vec2-base](https://huggingface.co/facebook/wav2vec2-base) on the common_voice_17_0 dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.4054
+- Wer: 1.0
 ## Model description
 - train_batch_size: 32
 - eval_batch_size: 8
 - seed: 42
+- optimizer: Use adamw_torch with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
 - lr_scheduler_type: linear
 - lr_scheduler_warmup_steps: 500
 - training_steps: 5000
 | Training Loss | Epoch   | Step | Validation Loss | Wer    |
 |:-------------:|:-------:|:----:|:---------------:|:------:|
+| 0.6577        | 3.5461  | 1000 | 0.4788          | 0.9997 |
+| 0.2893        | 7.0922  | 2000 | 0.4086          | 1.0    |
+| 0.1997        | 10.6383 | 3000 | 0.4135          | 0.9997 |
+| 0.156         | 14.1844 | 4000 | 0.4051          | 0.9992 |
+| 0.138         | 17.7305 | 5000 | 0.4054          | 1.0    |
 ### Framework versions
+- Transformers 4.49.0
+- Pytorch 2.4.1+cu124
+- Datasets 2.21.0
+- Tokenizers 0.21.0

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:ea070f7b75827e137dde471f08f0ab8bd83612b5aa65f2763824f5837b20d923
 size 377654192

 version https://git-lfs.github.com/spec/v1
+oid sha256:1e51717ae660d96760858d4f1145c2dc68f1a6722f0bc48389493bb16528a469
 size 377654192