Spaces:

Vinay15
/

Text-to-Speech_Model_for_English_Technical_Speech

Sleeping

App Files Files Community

Vinay15 commited on Oct 28, 2024

Commit

bba37d1

verified ·

1 Parent(s): c406abb

Update app.py

Browse files

Files changed (1) hide show

app.py +8 -10

app.py CHANGED Viewed

@@ -61,7 +61,7 @@ def text_to_speech(input_text):
     return output_file
-# Step 3: Create Gradio interface with examples, model description, and processing time note
 iface = gr.Interface(
     fn=text_to_speech,
     inputs="text",
@@ -69,21 +69,19 @@ iface = gr.Interface(
     title="Fine-tuning TTS for Technical Vocabulary",
     description="""
         Enter text containing technical terms or abbreviations for text-to-speech conversion. The model has been fine-tuned with a dataset specifically prepared to handle technical vocabulary and acronyms. This includes a pronunciation dictionary for terms such as API, CUDA, and OAuth. Sentence segmentation and custom pronunciation handling further optimize the output for natural, intelligible speech.
-        **Sample Examples:**
-        - "The API allows integration with OAuth and REST for scalable web services."
-        - "TensorFlow provides comprehensive tools for deep learning across various platforms."
-        - "What are continuous integration systems, and what is their role in the automated-build process?"
-        **Note:** Processing time may vary based on sentence length. Longer sentences may take additional time to generate speech. Additionally, the model’s performance improves as more technical terms are added to the pronunciation dictionary, enhancing accuracy for specialized vocabulary.
     """,
     examples=[
         ["What are continuous integration systems, and what is their role in the automated-build process?"],
         ["Using CUDA for deep learning optimizes the model training on GPU."],
         ["In TTS models, the vocoder is essential for natural-sounding speech."],
-        ["What is GPU?"]
     ]
 )
 # Step 4: Launch the app
-iface.launch(share=True)

     return output_file
+# Step 3: Create Gradio interface without sample examples
 iface = gr.Interface(
     fn=text_to_speech,
     inputs="text",
     title="Fine-tuning TTS for Technical Vocabulary",
     description="""
         Enter text containing technical terms or abbreviations for text-to-speech conversion. The model has been fine-tuned with a dataset specifically prepared to handle technical vocabulary and acronyms. This includes a pronunciation dictionary for terms such as API, CUDA, and OAuth. Sentence segmentation and custom pronunciation handling further optimize the output for natural, intelligible speech.
+        Note: Processing time may vary based on sentence length. Longer sentences may take additional time to generate speech. Additionally, the model’s performance improves as more technical terms are added to the pronunciation dictionary, enhancing accuracy for specialized vocabulary.
+        GitHub Repository: [Text-to-Speech Model for English Technical Speech](https://github.com/Vinay152003/Text-to-Speech_Model_for_English_Technical_Speech-Using-SpeechT5)
+        Report: [Project Report](https://drive.google.com/file/d/1CfnpeUi18R7De1uhilYuhMYLS_xXjh2Q/view)
     """,
     examples=[
+        ["What is GPU?"],
         ["What are continuous integration systems, and what is their role in the automated-build process?"],
         ["Using CUDA for deep learning optimizes the model training on GPU."],
         ["In TTS models, the vocoder is essential for natural-sounding speech."],
+        ["TensorFlow provides comprehensive tools for deep learning."],
+        ["The API allows integration with OAuth and REST for scalable web services."]
     ]
 )
 # Step 4: Launch the app
+iface.launch(share=True)