metadata
language: en
tags:
- audio-classification
- causal-representation
- infant-cry-detection
license: mit
datasets:
- custom-audio-dataset
metrics:
- event-based-f1
- iou
- accuracy
Infant Cry Detection Using Causal Temporal Representation
This model detects infant cries using a novel causal temporal representation framework. By integrating causal reasoning into the data-generating process (DGP), the model aims to enhance the interpretability and reliability of cry detection systems.
Features
- Causal Data Generating Process: Incorporates mathematical causal assumptions to define the relationship between audio features and annotations.
- Supervised Models: Includes pre-trained state-of-the-art models:
- Bidirectional LSTM
- Transformer
- MobileNet V2
- Event-Based Metrics: Tailored for time-sensitive detection tasks:
- Event-based F1-score
- Intersection over Union (IOU)
- Interactive Example: Jupyter Notebook with step-by-step usage demonstrations.
How to Use
You can load the model directly from Hugging Face: