README.md · diarizers-community/README at 40c310cb0542c9b27a73d94a3172c5045b1e4916

metadata

title: README
emoji: 🏃
colorFrom: indigo
colorTo: purple
sdk: static
pinned: false

diarizers-community aims to promote speaker diarization on the Hugging Face hub. It comes with diarizers, a library for fine-tuning pyannote speaker diarzaition models that is compatible with the Hugging Face ecosystem.

This organization contains:

A collection of multilingual speaker diarization datasets that are compatible with diarizers. They have been processed using diarizers scripts.

The currently available datasets are the CallHome (Japanese, Chinese, German, Spanish, English), the AMI Corpus (English), Vox-Converse (English) and Simsamu (French). We aim at adding more datasets in the future to support speaker diarization on the Hub.

A collection of 5 fine-tuned segmentation model baselines that can be used in a pyannote speaker diarization pipeline.
Each model has been fine-tuned on a specific language of the Callhome dataset. In comparison to the pretrained pyannote segmentation model, they reach better performance on each of the Callhome test sets:

** ADD BENCHMARK **

Edit this README.md markdown file to author your organization card.