--- license: cc-by-nc-sa-4.0 datasets: - mozilla-foundation/common_voice_17_0 - bond005/sberdevices_golos_10h_crowd - bond005/sberdevices_golos_100h_farfield - bond005/sova_rudevices - Aniemore/resd_annotated language: - ru - en base_model: - SWivid/F5-TTS --- ## Overview The F5-TTS model is finetuned for Russian and English language ## License This model is released under the Creative Commons Attribution Non Commercial Share Alike 4.0 license, which allows for free usage, modification, and distribution ## Model Information **Base Model**: SWivid/F5-TTS **Training Duration:** 813k steps **Dataset Duration:** 100k hours ## Train charts ![alt text](loss.svg) ![alt text](lr.svg) ## Training Configuration: ```json { "exp_name": "F5TTS_Base", "learning_rate": 1e-05, "batch_size_per_gpu": 5000, "batch_size_type": "frame", "max_samples": 64, "grad_accumulation_steps": 1, "max_grad_norm": 1, "epochs": 1, "num_warmup_updates": 405764, "save_per_updates": 811528, "keep_last_n_checkpoints": 5, "last_per_updates": 10000, "finetune": true, "file_checkpoint_train": "", "tokenizer_type": "char", "tokenizer_file": "", "mixed_precision": "fp16", "logger": "wandb", "bnb_optimizer": true } ``` ## Usage Instructions Go to [base repo](https://github.com/SWivid/F5-TTS) ## To do - Ask in community tab # Other links - [Github repo](https://github.com/HotDro4illa/F5-TTS)