--- license: mit library_name: safetensors tags: - text-to-speech - tts - hindi - speech-synthesis - code datasets: - SPRINGLab/IndicVoices-R_Hindi language: - hi model_type: F5-TTS base_model: - SWivid/F5-TTS --- # Hindi TTS (Text-to-Speech, 24kHz) ## Overview Hindi TTS is a high-quality Text-to-Speech model developed using the F5 TTS architecture. Built by FuturixAI and Quantum Works, this model enables natural-sounding Hindi speech synthesis and is distributed under the MIT license. It is intended for both research and commercial applications. ## Key Features - **Language:** Hindi - **Sampling Rate:** 24 kHz ## Training Data The model was trained on the **IndicVoices-R_Hindi** dataset provided by IIT Madras. - Dataset Link: [https://huggingface.co/datasets/SPRINGLab/IndicVoices-R_Hindi](https://huggingface.co/datasets/SPRINGLab/IndicVoices-R_Hindi) ## Usage Instructions ### Prerequisites Ensure you have installed the necessary dependencies for the `f5-tts_infer-cli`. Refer to the GitHub repository for installation instructions: [https://github.com/rumourscape/F5-TTS](https://github.com/rumourscape/F5-TTS) ### Example Usage ```bash f5-tts_infer-cli \ --model "Futurix-AI/Hindi-TTS" \ --ref_audio "ref_audio.wav" \ --ref_text "यह संदर्भ ऑडियो का सामग्री, उपशीर्षक या लिप्यंतरण है।" \ --gen_text "यह एक उदाहरण है जो मॉडल से बोलने के लिए उत्पन्न किया गया है।" ``` #### Parameters: - **`--model`**: Replace "hindi_tts_checkpoint.pth" with the actual checkpoint file name. - **`--ref_audio`**: Path to the reference audio file (e.g., "ref_audio.wav"). - **`--ref_text`**: Hindi text corresponding to the reference audio. - **`--gen_text`**: Hindi text for the TTS model to generate speech. --- license: mit ---