Spaces:
Running
Running
Update README.md
Browse files
README.md
CHANGED
@@ -8,5 +8,38 @@ sdk_version: 5.18.0
|
|
8 |
app_file: app.py
|
9 |
pinned: false
|
10 |
---
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
11 |
|
12 |
-
Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference
|
|
|
8 |
app_file: app.py
|
9 |
pinned: false
|
10 |
---
|
11 |
+
Here's a refined version of your text with improved clarity and flow:
|
12 |
+
|
13 |
+
---
|
14 |
+
Here's a refined version of your text with improved clarity and flow:
|
15 |
+
|
16 |
+
---
|
17 |
+
Here's a refined version of your text with improved clarity and flow:
|
18 |
+
|
19 |
+
---
|
20 |
+
|
21 |
+
### **Storytelling Text-to-Speech (TTS) Application**
|
22 |
+
|
23 |
+
#### **Project Objectives**
|
24 |
+
This project aims to develop an interactive Text-to-Speech (TTS) application that enables users to input stories and have them narrated using diverse voices in multiple languages.
|
25 |
+
|
26 |
+
#### **Implemented Pipelines**
|
27 |
+
The application leverages the Edge TTS service for high-quality voice synthesis, offering a range of neural voices across different languages. Key functionalities include:
|
28 |
+
|
29 |
+
- **Text Input:** Users can enter the story they wish to convert into speech.
|
30 |
+
- **Language Selection:** A dropdown menu allows users to choose from various languages, including English and Arabic.
|
31 |
+
- **Speaker Selection:** Based on the selected language, users can pick from a list of available speakers.
|
32 |
+
- **Audio Generation:** Clicking the “Generate Magical Audio” button processes the text and produces an audio file.
|
33 |
+
- **Audio Playback:** The generated audio file is displayed, allowing users to listen to the narration.
|
34 |
+
|
35 |
+
#### **How to Use the Interface**
|
36 |
+
1. **Enter Your Story:** Type your text into the provided input field.
|
37 |
+
2. **Select Language:** Choose your preferred language from the dropdown menu.
|
38 |
+
3. **Pick a Speaker:** Select a speaker corresponding to the chosen language.
|
39 |
+
4. **Generate Audio:** Click the "Generate Magical Audio" button to create the narration.
|
40 |
+
5. **Listen to the Output:** Once generated, the audio file will be available for playback.
|
41 |
+
|
42 |
+
#### **Justification for Model and Pipeline Choices**
|
43 |
+
- **Edge TTS Service:** Chosen for its advanced neural voice synthesis, Edge TTS delivers natural-sounding speech and supports multiple languages, enhancing the storytelling experience.
|
44 |
+
- **User-Friendly Interface:** The application is built using Gradio, ensuring an intuitive and interactive user experience for seamless text-to-speech conversion.
|
45 |
|
|