numpy scipy scikit-learn librosa soundfile pydub torch==2.0.1 torchaudio==2.0.2 torchlibrosa==0.1.0 torchvision==0.15.2 transformers==4.27.4 einops einops_exts huggingface-hub laion-clap==1.1.3