Note: This classifier also contains fine-tuned whisper-small weights in its state dict. It will be properly loaded by my model wrapper.

Result of the classifier Rob's human-annotated dataset (data/voicemail_human_eval.csv):

Results for chunk size 1 seconds:

  • Accuracy: 0.8080
  • Precision: 0.9353
  • Recall: 0.7692
  • F1 Score: 0.8442

Results for chunk size 2 seconds:

  • Accuracy: 0.8560
  • Precision: 0.9650
  • Recall: 0.8166
  • F1 Score: 0.8846

Results for chunk size 5 seconds:

  • Accuracy: 0.8640
  • Precision: 0.9856
  • Recall: 0.8107
  • F1 Score: 0.8896

Results for chunk size 10 seconds:

  • Accuracy: 0.8760
  • Precision: 1.0000
  • Recall: 0.8166
  • F1 Score: 0.8990

Results for full audio samples:

  • Accuracy: 0.8760
  • Precision: 1.0000
  • Recall: 0.8166
  • F1 Score: 0.8990
Downloads last month
68
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for SynthflowAI/whisper-small_voicemail_classification

Quantized
(18)
this model

Collection including SynthflowAI/whisper-small_voicemail_classification