Voice Activity Detection
Transformers
PyTorch
TensorBoard
Safetensors
pyannet
speaker-diarization
speaker-segmentation
Generated from Trainer
pyannote
pyannote.audio
pyannote-audio-model
audio
voice
speech
speaker
speaker-change-detection
overlapped-speech-detection
resegmentation
Inference Endpoints
File size: 350 Bytes
a1cd222 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 |
{
"architectures": [
"SegmentationModel"
],
"chunk_duration": 10.0,
"max_speakers_per_chunk": 3,
"max_speakers_per_frame": 2,
"min_duration": null,
"model_type": "pyannet",
"sample_rate": 16000,
"torch_dtype": "float32",
"transformers_version": "4.40.0",
"warm_up": [
0.0,
0.0
],
"weigh_by_cardinality": false
}
|