librosa==0.9.1 soundfile==0.10.3.post1 torch==1.11.0 transformers==4.18.0 speechbrain stt webrtcvad numpy ffmpeg-python soundfile==0.10.3.post1 wget aiofiles pydub git+https://github.com/NVIDIA/NeMo.git@r1.11.0#egg=nemo_toolkit[all] pyaudioconvert