NB's picture

NB

Skier8402

AI & ML interests

Practicing Computer Vision, Optimization, NLP and multimodal system implementation.

Recent Activity

updated a Space about 15 hours ago
newhorizons/Image_Splitter
updated a Space 1 day ago
Skier8402/crewai_article_editor
updated a Space 1 day ago
newhorizons/CLL_exp_annot
View all activity

Organizations

fast.ai community's profile picture Blog-explorers's profile picture Tangu Kale Labs's profile picture UltimateControllers's profile picture blacksheepinc's profile picture Social Post Explorers's profile picture Epidemiology World's profile picture Transcriptors's profile picture Hugging Face Discord Community's profile picture

Skier8402's activity

upvoted an article 2 days ago
view article
Article

Upgrading Kokoro: natural TTS for short bursts

By hexgrad โ€ข
โ€ข 19
reacted to hexgrad's post with ๐Ÿ”ฅ 2 days ago
view post
Post
8537
๐Ÿ“ฃ Looking for labeled, high-quality synthetic audio/TTS data ๐Ÿ“ฃ Have you been or are you currently calling API endpoints from OpenAI, ElevenLabs, etc? Do you have labeled audio data sitting around gathering dust? Let's talk! Join https://discord.gg/QuGxSWBfQy or comment down below.

If your data exceeds quantity & quality thresholds and is approved into the next hexgrad/Kokoro-82M training mix, and you permissively DM me the data under an effective Apache license, then I will DM back the corresponding voicepacks for YOUR data if/when the next Apache-licensed Kokoro base model drops.

What does this mean? If you've been calling closed-source TTS or audio API endpoints to:
- Build voice agents
- Make long-form audio, like audiobooks or podcasts
- Handle customer support, etc
Then YOU can contribute to the training mix and get useful artifacts in return. โค๏ธ

More details at hexgrad/Kokoro-82M#21
ยท
liked a Space 30 days ago