Spaces:

eniolaa
/

voice-chat-with-llm

Paused

Eniola Alese commited on Apr 19, 2024

Commit

132bd7c

1 Parent(s): 6b93710

update app files

Files changed (2) hide show

README.md CHANGED Viewed

@@ -7,6 +7,10 @@ sdk: gradio
 sdk_version: 3.48.0
 app_file: app.py
 pinned: false
 ---
 Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference

 sdk_version: 3.48.0
 app_file: app.py
 pinned: false
+models:
+  - coqui/XTTS-v2
+  - Systran/faster-whisper-large-v3
+  - TheBloke/Mistral-7B-Instruct-v0.1-GGUF
 ---
 Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference

app.py CHANGED Viewed

@@ -168,10 +168,10 @@ with gr.Blocks(title="Voice chat with LLM") as demo:
     FOOTNOTE = """
             This Space demonstrates how to speak to an llm chatbot, based solely on open accessible models.
-            It relies on following models :
-            - Speech to Text : [Faster-Whisper](https://github.com/SYSTRAN/faster-whisper/) an ASR model, to transcribe recorded audio to text.
-            - LLM Mistral    : [Mistral-7b-instruct](https://huggingface.co/mistralai/Mistral-7B-Instruct-v0.1) as the chatbot model.
-            - Text to Speech : [Coqui's XTTS V2](https://huggingface.co/spaces/coqui/xtts) as a Multilingual TTS model, to generate the voice of the chatbot.
             Note:
             - Responses generated by chat model should not be assumed correct or taken serious, as this is a demonstration example only

     FOOTNOTE = """
             This Space demonstrates how to speak to an llm chatbot, based solely on open accessible models.
+            It relies on the following models :
+            - Speech to Text Model: [Faster-Whisper-large-v3](https://huggingface.co/Systran/faster-whisper-large-v3) an ASR model, to transcribe recorded audio to text.
+            - Large Language Model: [Mistral-7b-instruct-v0.1-quantized](https://huggingface.co/TheBloke/Mistral-7B-Instruct-v0.1-GGUF) a LLM to generate the chatbot responses.
+            - Text to Speech Model: [XTTS-v2](https://huggingface.co/spaces/coqui/xtts) a TTS model, to generate the voice of the chatbot.
             Note:
             - Responses generated by chat model should not be assumed correct or taken serious, as this is a demonstration example only