Vinay15 commited on
Commit
bba37d1
·
verified ·
1 Parent(s): c406abb

Update app.py

Browse files
Files changed (1) hide show
  1. app.py +8 -10
app.py CHANGED
@@ -61,7 +61,7 @@ def text_to_speech(input_text):
61
 
62
  return output_file
63
 
64
- # Step 3: Create Gradio interface with examples, model description, and processing time note
65
  iface = gr.Interface(
66
  fn=text_to_speech,
67
  inputs="text",
@@ -69,21 +69,19 @@ iface = gr.Interface(
69
  title="Fine-tuning TTS for Technical Vocabulary",
70
  description="""
71
  Enter text containing technical terms or abbreviations for text-to-speech conversion. The model has been fine-tuned with a dataset specifically prepared to handle technical vocabulary and acronyms. This includes a pronunciation dictionary for terms such as API, CUDA, and OAuth. Sentence segmentation and custom pronunciation handling further optimize the output for natural, intelligible speech.
72
-
73
- **Sample Examples:**
74
- - "The API allows integration with OAuth and REST for scalable web services."
75
- - "TensorFlow provides comprehensive tools for deep learning across various platforms."
76
- - "What are continuous integration systems, and what is their role in the automated-build process?"
77
-
78
- **Note:** Processing time may vary based on sentence length. Longer sentences may take additional time to generate speech. Additionally, the model’s performance improves as more technical terms are added to the pronunciation dictionary, enhancing accuracy for specialized vocabulary.
79
  """,
80
  examples=[
 
81
  ["What are continuous integration systems, and what is their role in the automated-build process?"],
82
  ["Using CUDA for deep learning optimizes the model training on GPU."],
83
  ["In TTS models, the vocoder is essential for natural-sounding speech."],
84
- ["What is GPU?"]
 
85
  ]
86
  )
87
 
88
  # Step 4: Launch the app
89
- iface.launch(share=True)
 
61
 
62
  return output_file
63
 
64
+ # Step 3: Create Gradio interface without sample examples
65
  iface = gr.Interface(
66
  fn=text_to_speech,
67
  inputs="text",
 
69
  title="Fine-tuning TTS for Technical Vocabulary",
70
  description="""
71
  Enter text containing technical terms or abbreviations for text-to-speech conversion. The model has been fine-tuned with a dataset specifically prepared to handle technical vocabulary and acronyms. This includes a pronunciation dictionary for terms such as API, CUDA, and OAuth. Sentence segmentation and custom pronunciation handling further optimize the output for natural, intelligible speech.
72
+ Note: Processing time may vary based on sentence length. Longer sentences may take additional time to generate speech. Additionally, the model’s performance improves as more technical terms are added to the pronunciation dictionary, enhancing accuracy for specialized vocabulary.
73
+ GitHub Repository: [Text-to-Speech Model for English Technical Speech](https://github.com/Vinay152003/Text-to-Speech_Model_for_English_Technical_Speech-Using-SpeechT5)
74
+ Report: [Project Report](https://drive.google.com/file/d/1CfnpeUi18R7De1uhilYuhMYLS_xXjh2Q/view)
 
 
 
 
75
  """,
76
  examples=[
77
+ ["What is GPU?"],
78
  ["What are continuous integration systems, and what is their role in the automated-build process?"],
79
  ["Using CUDA for deep learning optimizes the model training on GPU."],
80
  ["In TTS models, the vocoder is essential for natural-sounding speech."],
81
+ ["TensorFlow provides comprehensive tools for deep learning."],
82
+ ["The API allows integration with OAuth and REST for scalable web services."]
83
  ]
84
  )
85
 
86
  # Step 4: Launch the app
87
+ iface.launch(share=True)