File size: 1,505 Bytes
e04e997 bc7c406 e04e997 d129819 e04e997 351cbd4 e04e997 810825f e04e997 bc7c406 e04e997 ae4c1bf d129819 8f27948 4f62cf0 d129819 ec4cce0 d129819 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 |
---
license: cc-by-nc-4.0
datasets:
- mesolitica/Malaysian-Emilia
language:
- ms
- en
base_model:
- SWivid/F5-TTS
---
# Full Parameter Finetuning Malaysian Emilia F5-TTS Speech Enhancement
Continue training from [SWivid/F5-TTS](https://huggingface.co/SWivid/F5-TTS) on post Speech-Enhancement [Malaysian-Emilia-annotated](https://huggingface.co/datasets/mesolitica/Malaysian-Emilia-annotated)
**This model should be able to zero-shot voice conversion any Malaysian and Singaporean speakers**.
WanDB at https://wandb.ai/huseinzol05/CFM-TTS
## Checkpoints
We uploaded full checkpoints with optimizer states for each 50k steps at [full-checkpoint](full-checkpoint).
If you found out the latest checkpoint is overfitting, we have also uploaded alternative checkpoints,
1. 150000 steps, [huseinzol05/Malaysian-F5-TTS-150000](https://huggingface.co/huseinzol05/Malaysian-F5-TTS-150000)
2. 200000 steps, [huseinzol05/Malaysian-F5-TTS-200000](https://huggingface.co/huseinzol05/Malaysian-F5-TTS-200000)
3. 250000 steps, [huseinzol05/Malaysian-F5-TTS-250000](https://huggingface.co/huseinzol05/Malaysian-F5-TTS-250000)
## Dataset
We train on postfilter [Malaysian-Emilia-annotated](https://huggingface.co/datasets/mesolitica/Malaysian-Emilia-annotated) called [Malaysian-Voice-Conversion-Speech-Enhancement](https://huggingface.co/datasets/mesolitica/Malaysian-Voice-Conversion-Speech-Enhancement)
## Source code
All source code at https://github.com/mesolitica/malaya-speech/tree/master/session/f5-tts |