MLSpeech commited on
Commit
4164e22
·
verified ·
1 Parent(s): 5b87bb9

Update description with note about reducing stereo audio to single channel files

Browse files
Files changed (1) hide show
  1. app.py +2 -1
app.py CHANGED
@@ -186,7 +186,8 @@ with gr.Blocks() as demo:
186
  - [Chun Liang Chan](https://staff.wcas.northwestern.edu/clc500/)
187
 
188
  ## Requirements
189
- - All speech files must be in .wav format. (Note: It is recommended to normalize the loudness of the files.)
 
190
  - All speech files that are being compared must contain productions of the identical linguistic content (i.e., same words in same order).
191
  - For example, the files may contain productions of a given sentence by different talkers, or by a single talker under different conditions.
192
  - Note that while the utility will return distance values for files with different content the interpretation of these values is meaningless.
 
186
  - [Chun Liang Chan](https://staff.wcas.northwestern.edu/clc500/)
187
 
188
  ## Requirements
189
+ - All speech files must be in a single channel .wav format. (Note: It is recommended to normalize the loudness of the files.)
190
+ - Stereo or multi channel audio files should be reduced to a single channel before processing. A [Praat](https://www.fon.hum.uva.nl/praat/) script that extracts a single channel from a directory of .wav files is available [here](https://huggingface.co/spaces/MLSpeech/perceptual-similarity/resolve/main/extractSingleChannel.praat).
191
  - All speech files that are being compared must contain productions of the identical linguistic content (i.e., same words in same order).
192
  - For example, the files may contain productions of a given sentence by different talkers, or by a single talker under different conditions.
193
  - Note that while the utility will return distance values for files with different content the interpretation of these values is meaningless.