metadata
license: cc-by-nc-sa-4.0
datasets:
- mozilla-foundation/common_voice_17_0
- bond005/sberdevices_golos_10h_crowd
- bond005/sberdevices_golos_100h_farfield
- bond005/sova_rudevices
- Aniemore/resd_annotated
language:
- ru
- en
base_model:
- SWivid/F5-TTS
Overview
The F5-TTS model is finetuned for Russian and English language
License
This model is released under the Creative Commons Attribution Non Commercial Share Alike 4.0 license, which allows for free usage, modification, and distribution
Model Information
Base Model: SWivid/F5-TTS
Training Duration: 813k steps
Dataset Duration: 100k hours
Train charts
Training Configuration:
{
"exp_name": "F5TTS_Base",
"learning_rate": 1e-05,
"batch_size_per_gpu": 5000,
"batch_size_type": "frame",
"max_samples": 64,
"grad_accumulation_steps": 1,
"max_grad_norm": 1,
"epochs": 1,
"num_warmup_updates": 405764,
"save_per_updates": 811528,
"keep_last_n_checkpoints": 5,
"last_per_updates": 10000,
"finetune": true,
"file_checkpoint_train": "",
"tokenizer_type": "char",
"tokenizer_file": "",
"mixed_precision": "fp16",
"logger": "wandb",
"bnb_optimizer": true
}
Usage Instructions
Go to base repo
To do
- Ask in community tab