Update model card.
Browse files
README.md
CHANGED
@@ -16,11 +16,17 @@ widget:
|
|
16 |
|
17 |
# wav2vec2-xls-r-parlaspeech-hr
|
18 |
|
19 |
-
This model for Croatian ASR is based on the [facebook/wav2vec2-xls-r-300m model](https://huggingface.co/facebook/wav2vec2-xls-r-300m) and was fine-tuned with
|
20 |
|
21 |
The efforts resulting in this model were coordinated by Nikola Ljubešić, the rough manual data alignment was performed by Ivo-Pavao Jazbec, the method for fine automatic data alignment from [Plüss et al.](https://arxiv.org/abs/2010.02810) was applied by Vuk Batanović and Lenka Bajčetić, the transcripts were normalised by Danijel Korzinek, while the final modelling was performed by Peter Rupnik.
|
22 |
|
23 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
24 |
|
25 |
## Usage in `transformers`
|
26 |
|
@@ -58,6 +64,8 @@ transcription = processor.decode(predicted_ids[0]).lower()
|
|
58 |
# transcription: 'veliki broj poslovnih subjekata posluje sa minusom velik dio'
|
59 |
```
|
60 |
|
|
|
|
|
61 |
## Training hyperparameters
|
62 |
|
63 |
In fine-tuning, the following arguments were used:
|
|
|
16 |
|
17 |
# wav2vec2-xls-r-parlaspeech-hr
|
18 |
|
19 |
+
This model for Croatian ASR is based on the [facebook/wav2vec2-xls-r-300m model](https://huggingface.co/facebook/wav2vec2-xls-r-300m) and was fine-tuned with 300 hours of recordings and transcripts from the Croatian parliament available [here](https://www.clarin.si/repository/xmlui/handle/11356/1494).
|
20 |
|
21 |
The efforts resulting in this model were coordinated by Nikola Ljubešić, the rough manual data alignment was performed by Ivo-Pavao Jazbec, the method for fine automatic data alignment from [Plüss et al.](https://arxiv.org/abs/2010.02810) was applied by Vuk Batanović and Lenka Bajčetić, the transcripts were normalised by Danijel Korzinek, while the final modelling was performed by Peter Rupnik.
|
22 |
|
23 |
+
## Metrics
|
24 |
+
|
25 |
+
|split|CER|WER|
|
26 |
+
|---|---|---|
|
27 |
+
|dev|0.0335|0.1046|
|
28 |
+
|test|0.0234|0.0761|
|
29 |
+
|
30 |
|
31 |
## Usage in `transformers`
|
32 |
|
|
|
64 |
# transcription: 'veliki broj poslovnih subjekata posluje sa minusom velik dio'
|
65 |
```
|
66 |
|
67 |
+
|
68 |
+
|
69 |
## Training hyperparameters
|
70 |
|
71 |
In fine-tuning, the following arguments were used:
|