agorlanov
fix readme
ad99144
|
raw
history blame
854 Bytes
metadata
title: Denoise And Diarization
emoji: 🐠
colorFrom: gray
colorTo: gray
sdk: gradio
sdk_version: 3.28.0
app_file: app.py
pinned: false

How inference:

  1. huggingface
  2. telegram bot
  3. run local inference:
    1. GUI: python app.py
    2. Inference local: python main_pipeline.py --audio-path dialog.mp3

About pipeline:

  • denoise audio
  • vad(voice activity detector)
  • speaker embeddings from each vad fragments
  • clustering this embeddings

Inference for hardware

inference time for file dialog.mp3
cpu 2v CPU huggingece 453.8 s/it
gpu tesla v100 8.23 s/it