Eniola Alese
commited on
Commit
·
132bd7c
1
Parent(s):
6b93710
update app files
Browse files
README.md
CHANGED
@@ -7,6 +7,10 @@ sdk: gradio
|
|
7 |
sdk_version: 3.48.0
|
8 |
app_file: app.py
|
9 |
pinned: false
|
|
|
|
|
|
|
|
|
10 |
---
|
11 |
|
12 |
Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference
|
|
|
7 |
sdk_version: 3.48.0
|
8 |
app_file: app.py
|
9 |
pinned: false
|
10 |
+
models:
|
11 |
+
- coqui/XTTS-v2
|
12 |
+
- Systran/faster-whisper-large-v3
|
13 |
+
- TheBloke/Mistral-7B-Instruct-v0.1-GGUF
|
14 |
---
|
15 |
|
16 |
Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference
|
app.py
CHANGED
@@ -168,10 +168,10 @@ with gr.Blocks(title="Voice chat with LLM") as demo:
|
|
168 |
|
169 |
FOOTNOTE = """
|
170 |
This Space demonstrates how to speak to an llm chatbot, based solely on open accessible models.
|
171 |
-
It relies on following models :
|
172 |
-
- Speech to Text : [Faster-Whisper](https://
|
173 |
-
-
|
174 |
-
- Text to Speech : [
|
175 |
|
176 |
Note:
|
177 |
- Responses generated by chat model should not be assumed correct or taken serious, as this is a demonstration example only
|
|
|
168 |
|
169 |
FOOTNOTE = """
|
170 |
This Space demonstrates how to speak to an llm chatbot, based solely on open accessible models.
|
171 |
+
It relies on the following models :
|
172 |
+
- Speech to Text Model: [Faster-Whisper-large-v3](https://huggingface.co/Systran/faster-whisper-large-v3) an ASR model, to transcribe recorded audio to text.
|
173 |
+
- Large Language Model: [Mistral-7b-instruct-v0.1-quantized](https://huggingface.co/TheBloke/Mistral-7B-Instruct-v0.1-GGUF) a LLM to generate the chatbot responses.
|
174 |
+
- Text to Speech Model: [XTTS-v2](https://huggingface.co/spaces/coqui/xtts) a TTS model, to generate the voice of the chatbot.
|
175 |
|
176 |
Note:
|
177 |
- Responses generated by chat model should not be assumed correct or taken serious, as this is a demonstration example only
|