PetrosStav
commited on
Commit
•
d112198
1
Parent(s):
320326a
Update README.md
Browse files
README.md
CHANGED
@@ -9,4 +9,20 @@ language:
|
|
9 |
base_model:
|
10 |
- SWivid/F5-TTS
|
11 |
pipeline_tag: text-to-speech
|
12 |
-
---
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
9 |
base_model:
|
10 |
- SWivid/F5-TTS
|
11 |
pipeline_tag: text-to-speech
|
12 |
+
---
|
13 |
+
|
14 |
+
F5-TTS model finetuned to speak Greek.
|
15 |
+
|
16 |
+
(This work is under development and is in beta version.)
|
17 |
+
|
18 |
+
Finetuned on Greek speech datasets and a small part of Emilia-EN dataset to prevent catastrophic forgetting of English.
|
19 |
+
|
20 |
+
Model can generate Greek text with Greek reference audio, English text with English reference, and mix of Greek and English (quality here needs improvement, and many runs might be needed).
|
21 |
+
|
22 |
+
Dataset consists of:
|
23 |
+
- Common Voice 12.0 (All Greek Splits)
|
24 |
+
- Greek Single Speaker Speech (https://www.kaggle.com/datasets/bryanpark/greek-single-speaker-speech-dataset)
|
25 |
+
- Small part of Emilia Dataset (https://huggingface.co/datasets/amphion/Emilia-Dataset) (EN-B000049.tar)
|
26 |
+
|
27 |
+
Github: https://github.com/SWivid/F5-TTS
|
28 |
+
Paper: F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching
|