Update README.md
Browse files
README.md
CHANGED
@@ -20,12 +20,12 @@ pipeline_tag: automatic-speech-recognition
|
|
20 |
|
21 |
## Model Details
|
22 |
|
23 |
-
This system is a collection of three fine-tuned models
|
24 |
-
|
|
|
25 |
|
26 |
Each model is a fine-tuned version of [pyannote/segmentation-3.0](https://huggingface.co/pyannote/segmentation-3.0) on [the RTVE database](https://catedrartve.unizar.es/rtvedatabase.html) used for Albayzin Evaluations of IberSPEECH 2024.
|
27 |
|
28 |
-
|
29 |
On the RTVE2024 test set it achives the following results (two-decimal rounding):
|
30 |
|
31 |
- Diarization Error Rate (DER): 14.98%
|
|
|
20 |
|
21 |
## Model Details
|
22 |
|
23 |
+
This system is a collection of three fine-tuned models, to be fused with [DOVER-Lap](https://github.com/desh2608/dover-lap).
|
24 |
+
Each models is fine-tuned monitoring a different metric component of Diarization Error Rate (i.e., False Alarm, Missed Detection, and Speaker Confusion).
|
25 |
+
More information about the fusion of these models can be found in this [paper](https://www.isca-archive.org/iberspeech_2024/souganidis24_iberspeech.html).
|
26 |
|
27 |
Each model is a fine-tuned version of [pyannote/segmentation-3.0](https://huggingface.co/pyannote/segmentation-3.0) on [the RTVE database](https://catedrartve.unizar.es/rtvedatabase.html) used for Albayzin Evaluations of IberSPEECH 2024.
|
28 |
|
|
|
29 |
On the RTVE2024 test set it achives the following results (two-decimal rounding):
|
30 |
|
31 |
- Diarization Error Rate (DER): 14.98%
|