dedgington
/

whisper_urban_sounds

Model card Files Files and versions Community

dedgington commited on Sep 5, 2024

Commit

bf688ca

·

verified ·

1 Parent(s): 659c2da

Update README.md

Files changed (1) hide show

README.md +10 -3

README.md CHANGED Viewed

@@ -1,3 +1,10 @@
----
-license: apache-2.0
----

+---
+license: apache-2.0
+---
+ fine-tune the pre-trained OpenAI Whisper model for audio classification in PyTorch.
+Audio classification is an important task that can be applied in various scenarios, such as speech dialogue detection, sentiment analysis, music genre recognition, environmental sound identification, etc.
+OpenAI Whisper is an excellent model for audio classification that achieved state-of-the-art results on several benchmarks. It is based on the transformer architecture and uses self-attention to process audio inputs. OpenAI Whisper can recognize speech and audio from different languages, accents, and domains with high accuracy and robustness.
+classify various sounds by fine-tuning the OpenAI Whisper model from Hugging Face in the PyTorch deep learning library. load the pre-trained model, prepare a custom audio dataset, train the model on the dataset, and evaluate the model performance.