File size: 2,253 Bytes
09a9e65
 
 
 
 
 
 
 
 
 
1a956dd
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
09a9e65
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
---
title: Audio Stories
emoji: 🌍
colorFrom: yellow
colorTo: gray
sdk: gradio
sdk_version: 5.18.0
app_file: app.py
pinned: false
---
Here's a refined version of your text with improved clarity and flow:  

---
Here's a refined version of your text with improved clarity and flow:  

---
Here's a refined version of your text with improved clarity and flow:  

---

### **Storytelling Text-to-Speech (TTS) Application**  

#### **Project Objectives**  
This project aims to develop an interactive Text-to-Speech (TTS) application that enables users to input stories and have them narrated using diverse voices in multiple languages.  

#### **Implemented Pipelines**  
The application leverages the Edge TTS service for high-quality voice synthesis, offering a range of neural voices across different languages. Key functionalities include:  

- **Text Input:** Users can enter the story they wish to convert into speech.  
- **Language Selection:** A dropdown menu allows users to choose from various languages, including English and Arabic.  
- **Speaker Selection:** Based on the selected language, users can pick from a list of available speakers.  
- **Audio Generation:** Clicking the “Generate Magical Audio” button processes the text and produces an audio file.  
- **Audio Playback:** The generated audio file is displayed, allowing users to listen to the narration.  

#### **How to Use the Interface**  
1. **Enter Your Story:** Type your text into the provided input field.  
2. **Select Language:** Choose your preferred language from the dropdown menu.  
3. **Pick a Speaker:** Select a speaker corresponding to the chosen language.  
4. **Generate Audio:** Click the "Generate Magical Audio" button to create the narration.  
5. **Listen to the Output:** Once generated, the audio file will be available for playback.  

#### **Justification for Model and Pipeline Choices**  
- **Edge TTS Service:** Chosen for its advanced neural voice synthesis, Edge TTS delivers natural-sounding speech and supports multiple languages, enhancing the storytelling experience.  
- **User-Friendly Interface:** The application is built using Gradio, ensuring an intuitive and interactive user experience for seamless text-to-speech conversion.