leeksang commited on
Commit
75052ff
ยท
verified ยท
1 Parent(s): 07ad250

Update readme.md

Browse files
Files changed (1) hide show
  1. readme.md +37 -22
readme.md CHANGED
@@ -1,22 +1,37 @@
1
- ---
2
- title: Accent Classifier
3
- emoji: "๐ŸŽ™๏ธ"
4
- colorFrom: indigo
5
- colorTo: pink
6
- sdk: gradio
7
- sdk_version: 5.32.0
8
- app_file: app.py
9
- pinned: false
10
- ---
11
-
12
- # ๐ŸŽ™๏ธ Accent Classifier App
13
-
14
- This Gradio-powered app allows you to paste a public video URL (YouTube, Vimeo, Dailymotion), download it with `yt-dlp`, extract the audio using `ffmpeg`, and classify the speaker identity (as a proxy for accent) using the `superb/wav2vec2-base-superb-sid` model from Hugging Face.
15
-
16
- ---
17
-
18
- ## ๐Ÿ”ง Setup
19
-
20
- ```bash
21
- pip install -r requirements.txt
22
- sudo apt install ffmpeg
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ title: Accent Classifier
3
+ emoji: ๐ŸŽ™๏ธ
4
+ colorFrom: teal
5
+ colorTo: cyan
6
+ sdk: gradio
7
+ sdk_version: "3.38.1"
8
+ app_file: app.py
9
+ pinned: false
10
+ ---
11
+
12
+ # Accent Classifier ๐ŸŽ™๏ธ
13
+
14
+ This app downloads a public YouTube or Vimeo video, extracts its audio, and classifies the speaker's accent (or rather, speaker ID as a proxy) using a Hugging Face model.
15
+
16
+ ### How it works
17
+
18
+ 1. You provide a video URL.
19
+ 2. The app downloads the audio using `yt-dlp`.
20
+ 3. It extracts the audio in a format suitable for the model (`wav`, 16kHz, mono).
21
+ 4. It runs the `superb/wav2vec2-base-superb-sid` model to classify the speaker.
22
+ 5. Displays the predicted speaker ID and confidence.
23
+
24
+ ### Requirements
25
+
26
+ - Python 3.8+
27
+ - `yt-dlp`
28
+ - `ffmpeg` installed on your system and accessible from the command line.
29
+ - `gradio` for the UI.
30
+ - `transformers` from Hugging Face.
31
+
32
+ ### Usage
33
+
34
+ Run the app:
35
+
36
+ ```bash
37
+ python app.py