F5-TTS-Russian / README.md
hotstone228's picture
Update README.md
4b9fcb5 verified
metadata
license: cc-by-nc-sa-4.0
datasets:
  - mozilla-foundation/common_voice_17_0
  - bond005/sberdevices_golos_10h_crowd
  - bond005/sberdevices_golos_100h_farfield
  - bond005/sova_rudevices
  - Aniemore/resd_annotated
language:
  - ru
  - en
base_model:
  - SWivid/F5-TTS

Overview

The F5-TTS model is finetuned for Russian and English language

License

This model is released under the Creative Commons Attribution Non Commercial Share Alike 4.0 license, which allows for free usage, modification, and distribution

Model Information

Base Model: SWivid/F5-TTS
Training Duration: 813k steps
Dataset Duration: 100k hours

Train charts

alt text alt text

Training Configuration:

{
    "exp_name": "F5TTS_Base",
    "learning_rate": 1e-05,
    "batch_size_per_gpu": 5000,
    "batch_size_type": "frame",
    "max_samples": 64,
    "grad_accumulation_steps": 1,
    "max_grad_norm": 1,
    "epochs": 1,
    "num_warmup_updates": 405764,
    "save_per_updates": 811528,
    "keep_last_n_checkpoints": 5,
    "last_per_updates": 10000,
    "finetune": true,
    "file_checkpoint_train": "",
    "tokenizer_type": "char",
    "tokenizer_file": "",
    "mixed_precision": "fp16",
    "logger": "wandb",
    "bnb_optimizer": true
}

Usage Instructions

Go to base repo

To do

  • Ask in community tab

Other links