Feature Extraction
Transformers
Safetensors
diva
custom_code
WillHeld commited on
Commit
72eaa0a
·
verified ·
1 Parent(s): d5ca5fa

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +8 -7
README.md CHANGED
@@ -3,19 +3,20 @@
3
  <!-- Provide a quick summary of what the model is/does. [Optional] -->
4
  This is an end-to-end Voice Assistant Model which can handle speech and text as inputs. It is trained using distillation loss. More details will be in a paper [COMING SOON]!
5
 
6
- See also [value-nlp.github.io/DiVA-Demo](value-nlp.github.io/DiVA-Demo).
7
 
8
  ## Citation
9
  No Publication As of Yet, But If You Use Please Cite the Below
10
  **BibTeX:**
11
 
12
  ```
13
- @InProceedings{hewitt2023backpack,
14
- author = "Held, Will and Zhang, Yanzhe and Ryan, Michael and Shi, Weiyan and Li, Ella and Yang, Diyi",
15
- title = "Distilling an End-to-End Voice Assistant from Speech Recognition Data",
16
- year = "2024",
17
- publisher = "HuggingFace",
18
- }
 
19
  ```
20
 
21
  ## Table of Contents
 
3
  <!-- Provide a quick summary of what the model is/does. [Optional] -->
4
  This is an end-to-end Voice Assistant Model which can handle speech and text as inputs. It is trained using distillation loss. More details will be in a paper [COMING SOON]!
5
 
6
+ See the model in action compared to SALMONN and Qwen-Audio at [value-nlp.github.io/DiVA-Demo](value-nlp.github.io/DiVA-Demo).
7
 
8
  ## Citation
9
  No Publication As of Yet, But If You Use Please Cite the Below
10
  **BibTeX:**
11
 
12
  ```
13
+ @misc{held2024diva,
14
+ author="Held, Will and Zhang, Yanzhe and Ryan, Michael and Shi, Weiyan and Li, Ella and Yang, Diyi",
15
+ title="Distilling an End-to-End Voice Assistant from Speech Recognition Data",
16
+ year="2024",
17
+ publisher="HuggingFace",
18
+ }
19
+
20
  ```
21
 
22
  ## Table of Contents