|
--- |
|
license: mit |
|
datasets: |
|
- google/speech_commands |
|
base_model: |
|
- facebook/wav2vec2-base |
|
--- |
|
# Command Classifier |
|
|
|
This is a lightweight classifier model trained on top of Wav2Vec2 features for classifying speech commands. |
|
|
|
- **Base model**: `facebook/wav2vec2-base` |
|
- **Fine-tuned model**: Lightweight linear classifier |
|
- **Dataset**: [Google Speech Commands v0.02](https://arxiv.org/abs/1804.03209) |
|
|
|
## Usage |
|
|
|
You can use this classifier by combining it with Wav2Vec2 features. The classifier expects mean-pooled Wav2Vec2 hidden states. |
|
|
|
```python |
|
from transformers import Wav2Vec2Model |
|
from command_classifier import CommandClassifier |
|
import torch |
|
|
|
wav2vec = Wav2Vec2Model.from_pretrained("facebook/wav2vec2-base") |
|
classifier = CommandClassifier(num_classes=35) |
|
classifier.load_state_dict(torch.load("pytorch_model.bin")) |
|
|