Spaces:
Running
Running
File size: 2,049 Bytes
04d7292 83407f3 59a0a12 83407f3 40c310c 6a9462c 40c310c c09d436 117475e c09d436 40c310c 59a0a12 6a9462c 117475e 6a9462c 117475e 6a9462c 59a0a12 9c4bb34 59a0a12 04d7292 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 |
---
title: README
emoji: π
colorFrom: indigo
colorTo: purple
sdk: static
pinned: false
---
[diarizers-community](https://huggingface.co/diarizers-community) aims to promote speaker diarization on the Hugging Face hub. It contains:
- A collection of [multilingual speaker diarization datasets](https://huggingface.co/collections/diarizers-community/speaker-diarization-datasets-66261b8d571552066e003788) that are compatible with the [diarizers](https://github.com/kamilakesbi/diarizers) library. They have been processed using [diarizers scripts](https://github.com/kamilakesbi/diarizers/blob/main/datasets/README.md).
The available datasets are the CallHome (Japanese, Chinese, German, Spanish, English), AMI Corpus (English), Vox-Converse (English) and Simsamu (French). We aim to add more datasets in the future to better support speaker diarising on the Hub.
- A collection of multilingual [fine-tuned segmentation model](https://huggingface.co/collections/diarizers-community/models-66261d0f9277b825c807ff2a) baselines compatible with pyannote.
Each model has been fine-tuned on a specific Callhome language subset. They achieve better performances on multilingual data compared to pyannote's pre-trained [segmentation-3.0](https://huggingface.co/pyannote/segmentation-3.0) model:
| First Header | Second Header |
| ------------- | ------------- |
| Content Cell | Content Cell |
| Content Cell | Content Cell |
Note: Results have been obtained using the [test script](https://github.com/kamilakesbi/diarizers/blob/main/test_segmentation.py) from diarizers.
Together with diarizers-community, we release:
- [diarizers](https://github.com/kamilakesbi/diarizers/tree/main), a library for fine-tuning pyannote speaker diarization models using the Hugging Face ecosystem.
- A google colab [notebook](https://colab.research.google.com/github/kamilakesbi/notebooks/blob/main/fine_tune_pyannote.ipynb), with a step-by-step guide on how to use diarizers.
Edit this `README.md` markdown file to author your organization card.
|