File size: 1,241 Bytes
4a7ac4e 17f2ed9 4a7ac4e 17f2ed9 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 |
---
license: mit
tags:
- dysarthric speech
- classification
- audio classification
---
# SSL4PR WavLM Base
This repository hosts the pre-trained SSL4PR WavLM Base models for Parkinson's Disease detection from speech in real-world operating conditions. These models are based on the work titled "Exploiting Foundation Models and Speech Enhancement for Parkinson’s Disease Detection from Speech in Real-World Operative Conditions" by Moreno La Quatra et al.
## Repository Link
[GitHub Repository](https://github.com/K-STMLab/SSL4PR/)
## Pre-trained Models
Pre-trained models are available on the Hugging Face model hub. To use the SSL4PR WavLM Base models, please clone the repository by running the following command:
```bash
git clone https://huggingface.co/morenolq/SSL4PR-wavlm-base
```
Ensure you have git lfs installed. Each repository contains the pre-trained models, one per fold, named `fold_1.pt`, `fold_2.pt`, ..., `fold_10.pt`.
The models are available in PyTorch format.
- [SSL4PR WavLM Base](https://huggingface.co/morenolq/SSL4PR-wavlm-base) - **this repository**
- [SSL4PR HuBERT Base](https://huggingface.co/morenolq/SSL4PR-hubert-base)
## Citation
```bibtex
Available upon INTERSPEECH 2024 proceedings.
``` |