Update README.md
Browse files
README.md
CHANGED
@@ -70,12 +70,6 @@ There are two checkpoints you can use. Tsukasa & Tsumugi 48khz (placeholder).
|
|
70 |
Tsukasa was trained on ~800 hours of studio grade, high quality data. sourced mainly from games and novels, part of it from a private dataset.
|
71 |
So the Japanese is going to be the "anime japanese" (it's different than what people usually speak in real-life.)
|
72 |
|
73 |
-
For Tsumugi (placeholder) a subset of this data was used with a 48khz config; at around ~300 hours but in a more controlled manner with additional manual cleaning & annotations.
|
74 |
-
|
75 |
-
**Unfortuantely Tsumugi (48khz)'s context length is capped and that means the model will not have enough information to handle the intonations as good as Tsukasa.
|
76 |
-
it also only supports the first mode of Kotodama's inference, which means no voice design.**
|
77 |
-
|
78 |
-
|
79 |
Brought to you by:
|
80 |
|
81 |
- Soshyant (me)
|
|
|
70 |
Tsukasa was trained on ~800 hours of studio grade, high quality data. sourced mainly from games and novels, part of it from a private dataset.
|
71 |
So the Japanese is going to be the "anime japanese" (it's different than what people usually speak in real-life.)
|
72 |
|
|
|
|
|
|
|
|
|
|
|
|
|
73 |
Brought to you by:
|
74 |
|
75 |
- Soshyant (me)
|