chsougan commited on
Commit
b061278
·
verified ·
1 Parent(s): fcb1619

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +3 -3
README.md CHANGED
@@ -20,12 +20,12 @@ pipeline_tag: automatic-speech-recognition
20
 
21
  ## Model Details
22
 
23
- This system is a collection of three fine-tuned models monitoring False Alarm, Missed Detection, and Speaker Confusion, to be fused with [DOVER-Lap](https://github.com/desh2608/dover-lap).
24
- More information about the fusion of these models can be found in this [paper](https://www.isca-archive.org/iberspeech_2024/souganidis24_iberspeech.html)
 
25
 
26
  Each model is a fine-tuned version of [pyannote/segmentation-3.0](https://huggingface.co/pyannote/segmentation-3.0) on [the RTVE database](https://catedrartve.unizar.es/rtvedatabase.html) used for Albayzin Evaluations of IberSPEECH 2024.
27
 
28
-
29
  On the RTVE2024 test set it achives the following results (two-decimal rounding):
30
 
31
  - Diarization Error Rate (DER): 14.98%
 
20
 
21
  ## Model Details
22
 
23
+ This system is a collection of three fine-tuned models, to be fused with [DOVER-Lap](https://github.com/desh2608/dover-lap).
24
+ Each models is fine-tuned monitoring a different metric component of Diarization Error Rate (i.e., False Alarm, Missed Detection, and Speaker Confusion).
25
+ More information about the fusion of these models can be found in this [paper](https://www.isca-archive.org/iberspeech_2024/souganidis24_iberspeech.html).
26
 
27
  Each model is a fine-tuned version of [pyannote/segmentation-3.0](https://huggingface.co/pyannote/segmentation-3.0) on [the RTVE database](https://catedrartve.unizar.es/rtvedatabase.html) used for Albayzin Evaluations of IberSPEECH 2024.
28
 
 
29
  On the RTVE2024 test set it achives the following results (two-decimal rounding):
30
 
31
  - Diarization Error Rate (DER): 14.98%