Update README.md
Browse files
README.md
CHANGED
@@ -3,19 +3,20 @@
|
|
3 |
<!-- Provide a quick summary of what the model is/does. [Optional] -->
|
4 |
This is an end-to-end Voice Assistant Model which can handle speech and text as inputs. It is trained using distillation loss. More details will be in a paper [COMING SOON]!
|
5 |
|
6 |
-
See
|
7 |
|
8 |
## Citation
|
9 |
No Publication As of Yet, But If You Use Please Cite the Below
|
10 |
**BibTeX:**
|
11 |
|
12 |
```
|
13 |
-
@
|
14 |
-
|
15 |
-
|
16 |
-
|
17 |
-
|
18 |
-
}
|
|
|
19 |
```
|
20 |
|
21 |
## Table of Contents
|
|
|
3 |
<!-- Provide a quick summary of what the model is/does. [Optional] -->
|
4 |
This is an end-to-end Voice Assistant Model which can handle speech and text as inputs. It is trained using distillation loss. More details will be in a paper [COMING SOON]!
|
5 |
|
6 |
+
See the model in action compared to SALMONN and Qwen-Audio at [value-nlp.github.io/DiVA-Demo](value-nlp.github.io/DiVA-Demo).
|
7 |
|
8 |
## Citation
|
9 |
No Publication As of Yet, But If You Use Please Cite the Below
|
10 |
**BibTeX:**
|
11 |
|
12 |
```
|
13 |
+
@misc{held2024diva,
|
14 |
+
author="Held, Will and Zhang, Yanzhe and Ryan, Michael and Shi, Weiyan and Li, Ella and Yang, Diyi",
|
15 |
+
title="Distilling an End-to-End Voice Assistant from Speech Recognition Data",
|
16 |
+
year="2024",
|
17 |
+
publisher="HuggingFace",
|
18 |
+
}
|
19 |
+
|
20 |
```
|
21 |
|
22 |
## Table of Contents
|