File size: 854 Bytes
93c280c
 
 
 
 
 
 
 
 
 
 
ad99144
a227627
 
 
 
 
 
 
 
ad99144
 
 
 
 
 
 
 
a227627
ad99144
 
 
 
a227627
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
---
title: Denoise And Diarization
emoji: 🐠
colorFrom: gray
colorTo: gray
sdk: gradio
sdk_version: 3.28.0
app_file: app.py
pinned: false
---

# How inference:
1) [huggingface](https://huggingface.co/spaces/deepkotix/denoise_and_diarization)
2) [telegram bot](https://t.me/diarizarion_bot)
3) run local inference:
   1) GUI:
   `python app.py`
   2) Inference local:
   `python main_pipeline.py --audio-path dialog.mp3`

# About pipeline:
+ denoise audio
+ vad(voice activity detector)
+ speaker embeddings from each vad fragments
+ clustering this embeddings


# Inference for hardware

|                       |  inference time for file dialog.mp3  |
|-----------------------|:------------------------------------:|
| cpu 2v CPU huggingece |              453.8 s/it              |
| gpu tesla v100        |              8.23 s/it               |