Lakoc commited on
Commit
8159163
·
verified ·
1 Parent(s): ecae0aa

End of training

Browse files
.gitattributes CHANGED
@@ -34,3 +34,4 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
36
  predictions_common_voice_13_en_common_voice_13_en_test_wer22.42_hyp.trn.snt.utt filter=lfs diff=lfs merge=lfs -text
 
 
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
36
  predictions_common_voice_13_en_common_voice_13_en_test_wer22.42_hyp.trn.snt.utt filter=lfs diff=lfs merge=lfs -text
37
+ predictions_common_voice_13_en_common_voice_13_en_test_wer19.69_hyp.trn.snt.utt filter=lfs diff=lfs merge=lfs -text
README.md CHANGED
@@ -17,16 +17,16 @@ should probably proofread and complete it, then remove this comment. -->
17
 
18
  This model is a fine-tuned version of [](https://huggingface.co/) on the common_voice_13_0 dataset.
19
  It achieves the following results on the evaluation set:
20
- - Cer: 0.0805
21
- - Deletions: 3333
22
- - Hits: 124879
23
- - Insertions: 3716
24
- - Loss: 1.1152
25
- - Mer: 0.1779
26
- - Substitutions: 19969
27
- - Wer: 0.1823
28
- - Wil: 0.2916
29
- - Wip: 0.7084
30
 
31
  ## Model description
32
 
@@ -56,35 +56,52 @@ The following hyperparameters were used during training:
56
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
57
  - lr_scheduler_type: linear
58
  - lr_scheduler_warmup_steps: 15000
59
- - num_epochs: 50.0
60
 
61
  ### Training results
62
 
63
- | Training Loss | Epoch | Step | Cer | Deletions | Hits | Insertions | Validation Loss | Mer | Substitutions | Wer | Wil | Wip |
64
- |:-------------:|:-----:|:-----:|:------:|:---------:|:------:|:----------:|:---------------:|:------:|:-------------:|:------:|:------:|:------:|
65
- | 1.7038 | 5.0 | 14885 | 0.1731 | 6463 | 103703 | 7394 | 1.5150 | 0.3334 | 38015 | 0.3501 | 0.5133 | 0.4867 |
66
- | 1.6051 | 6.0 | 17862 | 0.1512 | 6309 | 107805 | 6257 | 1.4257 | 0.3020 | 34067 | 0.3147 | 0.4705 | 0.5295 |
67
- | 1.5396 | 7.0 | 20839 | 0.1368 | 5262 | 110900 | 6029 | 1.3715 | 0.2809 | 32019 | 0.2923 | 0.4428 | 0.5572 |
68
- | 1.4436 | 8.0 | 23816 | 0.1299 | 5464 | 112866 | 5686 | 1.3285 | 0.2665 | 29851 | 0.2767 | 0.4207 | 0.5793 |
69
- | 1.4287 | 9.0 | 26793 | 0.1241 | 4793 | 114249 | 5680 | 1.3090 | 0.2575 | 29139 | 0.2673 | 0.4091 | 0.5909 |
70
- | 1.395 | 10.0 | 29770 | 0.1200 | 4698 | 115580 | 5505 | 1.2842 | 0.2479 | 27903 | 0.2572 | 0.3949 | 0.6051 |
71
- | 1.3668 | 11.0 | 32747 | 0.1111 | 4400 | 117281 | 4991 | 1.2510 | 0.2343 | 26500 | 0.2422 | 0.3761 | 0.6239 |
72
- | 1.3238 | 12.0 | 35724 | 0.1064 | 4560 | 117845 | 4500 | 1.2363 | 0.2282 | 25776 | 0.2351 | 0.3673 | 0.6327 |
73
- | 1.3133 | 13.0 | 38701 | 0.1028 | 4173 | 118971 | 4613 | 1.2215 | 0.2214 | 25037 | 0.2283 | 0.3573 | 0.6427 |
74
- | 1.2968 | 14.0 | 41678 | 0.0995 | 3937 | 119798 | 4466 | 1.2026 | 0.2152 | 24446 | 0.2217 | 0.3487 | 0.6513 |
75
- | 1.2783 | 15.0 | 44655 | 0.0974 | 4071 | 120231 | 4295 | 1.1939 | 0.2115 | 23879 | 0.2176 | 0.3427 | 0.6573 |
76
- | 1.2359 | 16.0 | 47632 | 0.0961 | 3946 | 120640 | 4313 | 1.1884 | 0.2089 | 23595 | 0.2150 | 0.3388 | 0.6612 |
77
- | 1.2543 | 17.0 | 50609 | 0.0939 | 3757 | 121623 | 4476 | 1.1743 | 0.2033 | 22801 | 0.2094 | 0.3296 | 0.6704 |
78
- | 1.2245 | 18.0 | 53586 | 0.0919 | 3981 | 121522 | 3944 | 1.1690 | 0.2012 | 22678 | 0.2065 | 0.3273 | 0.6727 |
79
- | 1.2 | 19.0 | 56563 | 0.0903 | 3819 | 122029 | 3995 | 1.1626 | 0.1981 | 22333 | 0.2034 | 0.3226 | 0.6774 |
80
- | 1.1964 | 20.0 | 59540 | 0.0916 | 3822 | 122170 | 4154 | 1.1598 | 0.1980 | 22189 | 0.2036 | 0.3218 | 0.6782 |
81
- | 1.1822 | 21.0 | 62517 | 0.0871 | 3630 | 122825 | 3981 | 1.1471 | 0.1928 | 21726 | 0.1980 | 0.3146 | 0.6854 |
82
- | 1.1758 | 22.0 | 65494 | 0.0862 | 3556 | 123114 | 3918 | 1.1413 | 0.1906 | 21511 | 0.1956 | 0.3114 | 0.6886 |
83
- | 1.1735 | 23.0 | 68471 | 0.0847 | 3431 | 123623 | 4013 | 1.1381 | 0.1877 | 21127 | 0.1928 | 0.3067 | 0.6933 |
84
- | 1.1556 | 24.0 | 71448 | 0.0839 | 3668 | 123854 | 3698 | 1.1282 | 0.1845 | 20659 | 0.1891 | 0.3015 | 0.6985 |
85
- | 1.1538 | 25.0 | 74425 | 0.0819 | 3475 | 124201 | 3716 | 1.1240 | 0.1823 | 20505 | 0.1869 | 0.2986 | 0.7014 |
86
- | 1.1078 | 26.0 | 77402 | 0.0819 | 3410 | 124426 | 3751 | 1.1259 | 0.1810 | 20345 | 0.1856 | 0.2965 | 0.7035 |
87
- | 1.1539 | 27.0 | 80379 | 0.0805 | 3333 | 124879 | 3716 | 1.1152 | 0.1779 | 19969 | 0.1823 | 0.2916 | 0.7084 |
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
88
 
89
 
90
  ### Framework versions
@@ -93,6 +110,3 @@ The following hyperparameters were used during training:
93
  - Pytorch 2.2.0+rocm5.6
94
  - Datasets 2.18.0
95
  - Tokenizers 0.15.2
96
-
97
- ### Wandb run
98
- https://wandb.ai/butspeechfit/decred_commonvoice_en/runs/ED_small_cv_v2_continue3
 
17
 
18
  This model is a fine-tuned version of [](https://huggingface.co/) on the common_voice_13_0 dataset.
19
  It achieves the following results on the evaluation set:
20
+ - Loss: 1.0688
21
+ - Cer: 0.0677
22
+ - Wer: 0.1598
23
+ - Mer: 0.1565
24
+ - Wil: 0.2593
25
+ - Wip: 0.7407
26
+ - Hits: 127573
27
+ - Substitutions: 17637
28
+ - Deletions: 2971
29
+ - Insertions: 3069
30
 
31
  ## Model description
32
 
 
56
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
57
  - lr_scheduler_type: linear
58
  - lr_scheduler_warmup_steps: 15000
59
+ - num_epochs: 70.0
60
 
61
  ### Training results
62
 
63
+ | Training Loss | Epoch | Step | Cer | Deletions | Hits | Insertions | Validation Loss | Mer | Substitutions | Wer | Wil | Wip |
64
+ |:-------------:|:-----:|:------:|:------:|:---------:|:------:|:----------:|:---------------:|:------:|:-------------:|:------:|:------:|:------:|
65
+ | 1.7038 | 5.0 | 14885 | 0.1731 | 6463 | 103703 | 7394 | 1.5150 | 0.3334 | 38015 | 0.3501 | 0.5133 | 0.4867 |
66
+ | 1.6051 | 6.0 | 17862 | 0.1512 | 6309 | 107805 | 6257 | 1.4257 | 0.3020 | 34067 | 0.3147 | 0.4705 | 0.5295 |
67
+ | 1.5396 | 7.0 | 20839 | 0.1368 | 5262 | 110900 | 6029 | 1.3715 | 0.2809 | 32019 | 0.2923 | 0.4428 | 0.5572 |
68
+ | 1.4436 | 8.0 | 23816 | 0.1299 | 5464 | 112866 | 5686 | 1.3285 | 0.2665 | 29851 | 0.2767 | 0.4207 | 0.5793 |
69
+ | 1.4287 | 9.0 | 26793 | 0.1241 | 4793 | 114249 | 5680 | 1.3090 | 0.2575 | 29139 | 0.2673 | 0.4091 | 0.5909 |
70
+ | 1.395 | 10.0 | 29770 | 0.1200 | 4698 | 115580 | 5505 | 1.2842 | 0.2479 | 27903 | 0.2572 | 0.3949 | 0.6051 |
71
+ | 1.3668 | 11.0 | 32747 | 0.1111 | 4400 | 117281 | 4991 | 1.2510 | 0.2343 | 26500 | 0.2422 | 0.3761 | 0.6239 |
72
+ | 1.3238 | 12.0 | 35724 | 0.1064 | 4560 | 117845 | 4500 | 1.2363 | 0.2282 | 25776 | 0.2351 | 0.3673 | 0.6327 |
73
+ | 1.3133 | 13.0 | 38701 | 0.1028 | 4173 | 118971 | 4613 | 1.2215 | 0.2214 | 25037 | 0.2283 | 0.3573 | 0.6427 |
74
+ | 1.2968 | 14.0 | 41678 | 0.0995 | 3937 | 119798 | 4466 | 1.2026 | 0.2152 | 24446 | 0.2217 | 0.3487 | 0.6513 |
75
+ | 1.2783 | 15.0 | 44655 | 0.0974 | 4071 | 120231 | 4295 | 1.1939 | 0.2115 | 23879 | 0.2176 | 0.3427 | 0.6573 |
76
+ | 1.2359 | 16.0 | 47632 | 0.0961 | 3946 | 120640 | 4313 | 1.1884 | 0.2089 | 23595 | 0.2150 | 0.3388 | 0.6612 |
77
+ | 1.2543 | 17.0 | 50609 | 0.0939 | 3757 | 121623 | 4476 | 1.1743 | 0.2033 | 22801 | 0.2094 | 0.3296 | 0.6704 |
78
+ | 1.2245 | 18.0 | 53586 | 0.0919 | 3981 | 121522 | 3944 | 1.1690 | 0.2012 | 22678 | 0.2065 | 0.3273 | 0.6727 |
79
+ | 1.2 | 19.0 | 56563 | 0.0903 | 3819 | 122029 | 3995 | 1.1626 | 0.1981 | 22333 | 0.2034 | 0.3226 | 0.6774 |
80
+ | 1.1964 | 20.0 | 59540 | 0.0916 | 3822 | 122170 | 4154 | 1.1598 | 0.1980 | 22189 | 0.2036 | 0.3218 | 0.6782 |
81
+ | 1.1822 | 21.0 | 62517 | 0.0871 | 3630 | 122825 | 3981 | 1.1471 | 0.1928 | 21726 | 0.1980 | 0.3146 | 0.6854 |
82
+ | 1.1758 | 22.0 | 65494 | 0.0862 | 3556 | 123114 | 3918 | 1.1413 | 0.1906 | 21511 | 0.1956 | 0.3114 | 0.6886 |
83
+ | 1.1735 | 23.0 | 68471 | 0.0847 | 3431 | 123623 | 4013 | 1.1381 | 0.1877 | 21127 | 0.1928 | 0.3067 | 0.6933 |
84
+ | 1.1556 | 24.0 | 71448 | 0.0839 | 3668 | 123854 | 3698 | 1.1282 | 0.1845 | 20659 | 0.1891 | 0.3015 | 0.6985 |
85
+ | 1.1538 | 25.0 | 74425 | 0.0819 | 3475 | 124201 | 3716 | 1.1240 | 0.1823 | 20505 | 0.1869 | 0.2986 | 0.7014 |
86
+ | 1.1078 | 26.0 | 77402 | 0.0819 | 3410 | 124426 | 3751 | 1.1259 | 0.1810 | 20345 | 0.1856 | 0.2965 | 0.7035 |
87
+ | 1.1539 | 27.0 | 80379 | 0.0805 | 3333 | 124879 | 3716 | 1.1152 | 0.1779 | 19969 | 0.1823 | 0.2916 | 0.7084 |
88
+ | 1.1432 | 54.0 | 80406 | 1.1113 | 0.0787 | 0.1790 | 0.1747 | 0.2868 | 0.7132 | 125277 | 19604 | 3300 | 3619 |
89
+ | 1.1171 | 55.0 | 81895 | 1.0912 | 0.0744 | 0.1713 | 0.1676 | 0.2763 | 0.7237 | 126048 | 18870 | 3263 | 3245 |
90
+ | 1.1027 | 56.0 | 83384 | 1.0874 | 0.0740 | 0.1696 | 0.1659 | 0.2736 | 0.7264 | 126362 | 18663 | 3156 | 3309 |
91
+ | 1.0827 | 57.0 | 84873 | 1.0865 | 0.0725 | 0.1690 | 0.1654 | 0.2728 | 0.7272 | 126356 | 18599 | 3226 | 3214 |
92
+ | 1.0794 | 58.0 | 86362 | 1.0837 | 0.0717 | 0.1665 | 0.1629 | 0.2691 | 0.7309 | 126790 | 18361 | 3030 | 3287 |
93
+ | 1.0585 | 59.0 | 87851 | 1.0816 | 0.0710 | 0.1664 | 0.1629 | 0.2688 | 0.7312 | 126738 | 18285 | 3158 | 3218 |
94
+ | 1.0549 | 60.0 | 89340 | 1.0785 | 0.0707 | 0.1651 | 0.1616 | 0.2671 | 0.7329 | 126913 | 18198 | 3070 | 3195 |
95
+ | 1.0708 | 61.0 | 90829 | 1.0795 | 0.0704 | 0.1649 | 0.1614 | 0.2667 | 0.7333 | 126928 | 18157 | 3096 | 3178 |
96
+ | 1.0674 | 62.0 | 92318 | 1.0767 | 0.0699 | 0.1638 | 0.1605 | 0.2650 | 0.7350 | 126981 | 17994 | 3206 | 3071 |
97
+ | 1.0709 | 63.0 | 93807 | 1.0738 | 0.0699 | 0.1638 | 0.1605 | 0.2652 | 0.7348 | 126999 | 18030 | 3152 | 3096 |
98
+ | 1.0672 | 64.0 | 95296 | 1.0734 | 0.0687 | 0.1622 | 0.1588 | 0.2630 | 0.7370 | 127257 | 17925 | 2999 | 3105 |
99
+ | 1.0716 | 65.0 | 96785 | 1.0712 | 0.0685 | 0.1610 | 0.1577 | 0.2613 | 0.7387 | 127412 | 17804 | 2965 | 3082 |
100
+ | 1.0664 | 66.0 | 98274 | 1.0723 | 0.0686 | 0.1613 | 0.1581 | 0.2618 | 0.7382 | 127312 | 17817 | 3052 | 3039 |
101
+ | 1.0452 | 67.0 | 99763 | 1.0703 | 0.0681 | 0.1605 | 0.1572 | 0.2605 | 0.7395 | 127444 | 17742 | 2995 | 3041 |
102
+ | 1.0318 | 68.0 | 101252 | 1.0695 | 0.0679 | 0.1603 | 0.1571 | 0.2601 | 0.7399 | 127479 | 17693 | 3009 | 3049 |
103
+ | 1.0341 | 69.0 | 102741 | 1.0686 | 0.0677 | 0.1597 | 0.1565 | 0.2590 | 0.7410 | 127600 | 17599 | 2982 | 3088 |
104
+ | 1.0338 | 70.0 | 104230 | 1.0688 | 0.0677 | 0.1598 | 0.1565 | 0.2593 | 0.7407 | 127573 | 17637 | 2971 | 3069 |
105
 
106
 
107
  ### Framework versions
 
110
  - Pytorch 2.2.0+rocm5.6
111
  - Datasets 2.18.0
112
  - Tokenizers 0.15.2
 
 
 
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:ec9aa716122a67ba716a0cc098837038e2f1c645a2257b3ed26bf4009ed8a086
3
  size 143219080
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:73cce251b5893068e4230d7315a13d3c567482662f407dfe7c16d5f7a93d10d1
3
  size 143219080
predictions_common_voice_13_en_common_voice_13_en_test_wer19.69.csv ADDED
The diff for this file is too large to render. See raw diff
 
predictions_common_voice_13_en_common_voice_13_en_test_wer19.69_hyp.trn ADDED
The diff for this file is too large to render. See raw diff
 
predictions_common_voice_13_en_common_voice_13_en_test_wer19.69_hyp.trn.dtl ADDED
The diff for this file is too large to render. See raw diff
 
predictions_common_voice_13_en_common_voice_13_en_test_wer19.69_hyp.trn.snt.utt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:2b90738aba0613777cd581715a99d06954ffe516bfbd7b30e39acd55b0197c44
3
+ size 11444711
predictions_common_voice_13_en_common_voice_13_en_test_wer19.69_hyp.trn.sys ADDED
@@ -0,0 +1,18 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+
2
+
3
+
4
+ SYSTEM SUMMARY PERCENTAGES by SPEAKER
5
+
6
+ ,------------------------------------------------------------------------------------------------------------------------------------------------------------------------.
7
+ |/scratch/project_465000836/ipoloka/huggingface_asr/experiments/decred/commonvoice/ED_small_cv_v2/predictions_common_voice_13_en_common_voice_13_en_test_wer19.69_hyp.trn|
8
+ |------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
9
+ | SPKR | # Snt # Wrd | Corr Sub Del Ins Err S.Err |
10
+ |-------------------+----------------------------------------+-----------------------------------------------------------------------------------------------------------|
11
+ | utt | 16384 144172 | 83.0 14.3 2.7 2.7 19.7 63.7 |
12
+ |========================================================================================================================================================================|
13
+ | Sum/Avg | 16384 144172 | 83.0 14.3 2.7 2.7 19.7 63.7 |
14
+ |========================================================================================================================================================================|
15
+ | Mean | 16384.0 144172.0 | 83.0 14.3 2.7 2.7 19.7 63.7 |
16
+ | S.D. | 0.0 0.0 | 0.0 0.0 0.0 0.0 0.0 0.0 |
17
+ | Median | 16384.0 144172.0 | 83.0 14.3 2.7 2.7 19.7 63.7 |
18
+ `------------------------------------------------------------------------------------------------------------------------------------------------------------------------'
predictions_common_voice_13_en_common_voice_13_en_test_wer19.69_ref.trn ADDED
The diff for this file is too large to render. See raw diff
 
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:c8ce3e41c814767e6cf45a76302247ed0c308f558534b18d8ec51d5bbcd57c9f
3
  size 5816
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:7b13090ea7d998d9981889aa3b084242294953295b5c379131f4d9ec2fe35c5e
3
  size 5816