Spaces:
Running
Running
Update description with note about reducing stereo audio to single channel files
Browse files
app.py
CHANGED
@@ -186,7 +186,8 @@ with gr.Blocks() as demo:
|
|
186 |
- [Chun Liang Chan](https://staff.wcas.northwestern.edu/clc500/)
|
187 |
|
188 |
## Requirements
|
189 |
-
- All speech files must be in .wav format. (Note: It is recommended to normalize the loudness of the files.)
|
|
|
190 |
- All speech files that are being compared must contain productions of the identical linguistic content (i.e., same words in same order).
|
191 |
- For example, the files may contain productions of a given sentence by different talkers, or by a single talker under different conditions.
|
192 |
- Note that while the utility will return distance values for files with different content the interpretation of these values is meaningless.
|
|
|
186 |
- [Chun Liang Chan](https://staff.wcas.northwestern.edu/clc500/)
|
187 |
|
188 |
## Requirements
|
189 |
+
- All speech files must be in a single channel .wav format. (Note: It is recommended to normalize the loudness of the files.)
|
190 |
+
- Stereo or multi channel audio files should be reduced to a single channel before processing. A [Praat](https://www.fon.hum.uva.nl/praat/) script that extracts a single channel from a directory of .wav files is available [here](https://huggingface.co/spaces/MLSpeech/perceptual-similarity/resolve/main/extractSingleChannel.praat).
|
191 |
- All speech files that are being compared must contain productions of the identical linguistic content (i.e., same words in same order).
|
192 |
- For example, the files may contain productions of a given sentence by different talkers, or by a single talker under different conditions.
|
193 |
- Note that while the utility will return distance values for files with different content the interpretation of these values is meaningless.
|