Text-to-Speech
F5-TTS
Italian
alien79 commited on
Commit
5c7f270
1 Parent(s): 1242903

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +3 -9
README.md CHANGED
@@ -11,15 +11,9 @@ license: cc-by-4.0
11
 
12
  This is a test to see how to finetune F5 in italian
13
 
14
- Trained over 9h split of facebook/multilingual_librispeech dataset for 200 Epoch:
15
  - catastrophic failure (the model forgot english)
16
- - lost ability to clone voice properly
17
- - italian pronunciation not yet good enough
18
 
19
- The last produced file, the one to test, is model_italian_200e_9h.safetensors
20
 
21
- The run.py file is an example of how to extract the wav files and produce the metadata.csv to use for training
22
-
23
- UPDATE:
24
-
25
- trying to finetune on the full Italian "train" split of the same dataset with 247 hours
 
11
 
12
  This is a test to see how to finetune F5 in italian
13
 
14
+ Trained over 247+h hours of "train" split of facebook/multilingual_librispeech dataset, 6700 steps for Epoch:
15
  - catastrophic failure (the model forgot english)
16
+ - italian pronunciation not perfect
 
17
 
 
18
 
19
+ The run.py file is an example of how to extract the wav files and produce the metadata.csv to use for training