Update README.md
Browse files
README.md
CHANGED
@@ -12,7 +12,7 @@ Disclaimer: The team releasing Audio Spectrogram Transformer did not write a mod
|
|
12 |
|
13 |
## Model description
|
14 |
|
15 |
-
The Audio Spectrogram Transformer is equivalent to [ViT](https://huggingface.co/docs/transformers/model_doc/
|
16 |
|
17 |
## Usage
|
18 |
|
|
|
12 |
|
13 |
## Model description
|
14 |
|
15 |
+
The Audio Spectrogram Transformer is equivalent to [ViT](https://huggingface.co/docs/transformers/model_doc/vit), but applied on audio. Audio is first turned into an image (as a spectrogram), after which a Vision Transformer is applied. The model gets state-of-the-art results on several audio classification benchmarks.
|
16 |
|
17 |
## Usage
|
18 |
|