hotstone228 commited on
Commit
4b9fcb5
·
verified ·
1 Parent(s): 2910a02

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +32 -23
README.md CHANGED
@@ -3,51 +3,60 @@ license: cc-by-nc-sa-4.0
3
  datasets:
4
  - mozilla-foundation/common_voice_17_0
5
  - bond005/sberdevices_golos_10h_crowd
 
6
  - bond005/sova_rudevices
7
  - Aniemore/resd_annotated
8
  language:
9
  - ru
 
10
  base_model:
11
  - SWivid/F5-TTS
12
  ---
13
  ## Overview
14
- The F5-TTS model is finetuned specifically for Russian language
15
 
16
  ## License
17
  This model is released under the Creative Commons Attribution Non Commercial Share Alike 4.0 license, which allows for free usage, modification, and distribution
18
 
19
  ## Model Information
20
  **Base Model**: SWivid/F5-TTS
21
- **Total Training Duration:** 250.000 steps
 
22
 
23
- **Training Configuration:**
 
 
 
 
24
  ```json
25
- "exp_name": "F5TTS_Base",
26
- "learning_rate": 1e-05,
27
- "batch_size_per_gpu": 4500,
28
- "batch_size_type": "frame",
29
- "max_samples": 64,
30
- "grad_accumulation_steps": 1,
31
- "max_grad_norm": 1,
32
- "epochs": 144,
33
- "num_warmup_updates": 5838,
34
- "save_per_updates": 11676,
35
- "last_per_steps": 2918,
36
- "finetune": true,
37
- "file_checkpoint_train": "",
38
- "tokenizer_type": "char",
39
- "tokenizer_file": "",
40
- "mixed_precision": "fp16",
41
- "logger": "wandb",
42
- "bnb_optimizer": true
 
 
 
43
  ```
44
 
45
  ## Usage Instructions
46
  Go to [base repo](https://github.com/SWivid/F5-TTS)
47
 
48
  ## To do
49
- - Correct stressmarks
50
- - English support
51
 
52
  # Other links
53
  - [Github repo](https://github.com/HotDro4illa/F5-TTS)
 
3
  datasets:
4
  - mozilla-foundation/common_voice_17_0
5
  - bond005/sberdevices_golos_10h_crowd
6
+ - bond005/sberdevices_golos_100h_farfield
7
  - bond005/sova_rudevices
8
  - Aniemore/resd_annotated
9
  language:
10
  - ru
11
+ - en
12
  base_model:
13
  - SWivid/F5-TTS
14
  ---
15
  ## Overview
16
+ The F5-TTS model is finetuned for Russian and English language
17
 
18
  ## License
19
  This model is released under the Creative Commons Attribution Non Commercial Share Alike 4.0 license, which allows for free usage, modification, and distribution
20
 
21
  ## Model Information
22
  **Base Model**: SWivid/F5-TTS
23
+ **Training Duration:** 813k steps
24
+ **Dataset Duration:** 100k hours
25
 
26
+ ## Train charts
27
+ ![alt text](loss.svg)
28
+ ![alt text](lr.svg)
29
+
30
+ ## Training Configuration:
31
  ```json
32
+ {
33
+ "exp_name": "F5TTS_Base",
34
+ "learning_rate": 1e-05,
35
+ "batch_size_per_gpu": 5000,
36
+ "batch_size_type": "frame",
37
+ "max_samples": 64,
38
+ "grad_accumulation_steps": 1,
39
+ "max_grad_norm": 1,
40
+ "epochs": 1,
41
+ "num_warmup_updates": 405764,
42
+ "save_per_updates": 811528,
43
+ "keep_last_n_checkpoints": 5,
44
+ "last_per_updates": 10000,
45
+ "finetune": true,
46
+ "file_checkpoint_train": "",
47
+ "tokenizer_type": "char",
48
+ "tokenizer_file": "",
49
+ "mixed_precision": "fp16",
50
+ "logger": "wandb",
51
+ "bnb_optimizer": true
52
+ }
53
  ```
54
 
55
  ## Usage Instructions
56
  Go to [base repo](https://github.com/SWivid/F5-TTS)
57
 
58
  ## To do
59
+ - Ask in community tab
 
60
 
61
  # Other links
62
  - [Github repo](https://github.com/HotDro4illa/F5-TTS)