File size: 1,686 Bytes
37cc488
 
 
2322a39
e89fe21
cc2d8ad
322cd2c
 
 
4ac7bb3
 
322cd2c
b4533f1
cc2d8ad
c8ca297
4ac7bb3
cc2d8ad
b4533f1
c8ca297
 
4ac7bb3
c8ca297
 
db25481
c8ca297
 
4ac7bb3
 
c8ca297
 
db25481
c8ca297
59b9c93
4ac7bb3
 
23e9296
a956b92
 
 
 
 
 
23e9296
 
 
4ac7bb3
 
 
490107f
4ac7bb3
 
 
9e598a2
4ac7bb3
55dd0b8
 
9e598a2
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
---
license: openrail
---

Test RVC models on the DDLC character Monika, via various hyperparams and datasets.

# monika-test-0 (~07/2023)
* Trained on augmented dataset of ~10 10 second clips
* Trained for ~100 epochs
* RVC1
* "Version 1" ("0" in the old numbering)
  
# monika-test-2 (~07/2023)
* Trained on augmented dataset of ~10 10 second clips (augmented via tortoise tts)
* Trained for 100 epochs
* RVC1
  
# monika-test-4 (~07/2023)
* Trained on smaller but better dataset of ~2 10 second clips (augmented via 11labs)
* Trained for 150 epochs
* RVC1

# monika-test-7 (08/22/2023)
* Trained on augmented dataset of ~10+ 10 second clips (augmented via tortoise tts)
* Trained for 60 epochs (720 steps)
* Better quality than others
* "Version 2" ("1" in old numbering)
* RVC2

# monika-test-8 (08/22/2023)
* Trained on smaller but better dataset of ~5 10 second clips (some augmented via 11labs)
* Trained for 60 epochs (660 steps)
* Even clearer quality but with slightly more artifacting than monika-test-7 (still better than pre 7th ones)
* "Version 2a" ("1a" in old numbering)
* RVC2

# ct-m3 (~10/2023)
* Trained on preprocessed version of dataset of ~5 10 second clips
* Trained for ~100 epochs
* Test model
* RVC1

# ct-m4 (~10/2023)
* Trained on preprocessed version of dataset of ~5 10 second clips
* Trained for ~200 epochs
* Test model
* RVC1

# ct-m4a (~10/2023)
* Trained on preprocessed version of dataset of ~5 10 second clips
* Trained for ~200 epochs
* "Version 4" ("3" in old numbering)
* RVC2

# fused2 (~02/2024)
* Merge between ct-m3 and another model ("Sayori"-based model, with ratio of 75% to 25%)
* Somewhat clearer quality
* Yet another test model
* RVC2