Spaces:
No application file
No application file
<div align="center"> | |
<h1>Fish Speech</h1> | |
[English](../README.md) | [็ฎไฝไธญๆ](README.zh.md) | [Portuguese](README.pt-BR.md) | [ๆฅๆฌ่ช](README.ja.md) | **ํ๊ตญ์ด** <br> | |
<a href="https://www.producthunt.com/posts/fish-speech-1-4?embed=true&utm_source=badge-featured&utm_medium=badge&utm_souce=badge-fish-speech-1-4" target="_blank"> | |
<img src="https://api.producthunt.com/widgets/embed-image/v1/featured.svg?post_id=488440&theme=light" alt="Fish Speech 1.4 - Open-Source Multilingual Text-to-Speech with Voice Cloning | Product Hunt" style="width: 250px; height: 54px;" width="250" height="54" /> | |
</a> | |
<a href="https://trendshift.io/repositories/7014" target="_blank"> | |
<img src="https://trendshift.io/api/badge/repositories/7014" alt="fishaudio%2Ffish-speech | Trendshift" style="width: 250px; height: 55px;" width="250" height="55"/> | |
</a> | |
<br> | |
</div> | |
<br> | |
<div align="center"> | |
<img src="https://count.getloli.com/get/@fish-speech?theme=asoul" /><br> | |
</div> | |
<br> | |
<div align="center"> | |
<a target="_blank" href="https://discord.gg/Es5qTB9BcN"> | |
<img alt="Discord" src="https://img.shields.io/discord/1214047546020728892?color=%23738ADB&label=Discord&logo=discord&logoColor=white&style=flat-square"/> | |
</a> | |
<a target="_blank" href="https://hub.docker.com/r/fishaudio/fish-speech"> | |
<img alt="Docker" src="https://img.shields.io/docker/pulls/fishaudio/fish-speech?style=flat-square&logo=docker"/> | |
</a> | |
<a target="_blank" href="https://huggingface.co/spaces/fishaudio/fish-speech-1"> | |
<img alt="Huggingface" src="https://img.shields.io/badge/๐ค%20-space%20demo-yellow"/> | |
</a> | |
</div> | |
์ด ์ฝ๋๋ฒ ์ด์ค์ ๋ชจ๋ ๋ชจ๋ธ์ CC-BY-NC-SA-4.0 ๋ผ์ด์ ์ค์ ๋ฐ๋ผ ๋ฐฐํฌ๋ฉ๋๋ค. ์์ธํ ๋ด์ฉ์ [LICENSE](LICENSE)๋ฅผ ์ฐธ์กฐํ์๊ธธ ๋ฐ๋๋๋ค. | |
--- | |
## ๊ธฐ๋ฅ | |
1. **Zero-shot & Few-shot TTS:** 10์ด์์ 30์ด์ ์์ฑ ์ํ์ ์ ๋ ฅํ์ฌ ๊ณ ํ์ง์ TTS ์ถ๋ ฅ์ ์์ฑํฉ๋๋ค. **์์ธํ ๊ฐ์ด๋๋ [๋ชจ๋ฒ ์ฌ๋ก](https://docs.fish.audio/text-to-speech/voice-clone-best-practices)๋ฅผ ์ฐธ์กฐํ์๊ธธ ๋ฐ๋๋๋ค.** | |
2. **๋ค๊ตญ์ด ๋ฐ ๊ต์ฐจ ์ธ์ด ์ง์:** ๋ค๊ตญ์ด ๊ฑฑ์ ์์ด, ํ ์คํธ๋ฅผ ์ ๋ ฅ์ฐฝ์ ๋ณต์ฌํ์ฌ ๋ถ์ฌ๋ฃ๊ธฐ๋ง ํ๋ฉด ๋ฉ๋๋ค. ํ์ฌ ์์ด, ์ผ๋ณธ์ด, ํ๊ตญ์ด, ์ค๊ตญ์ด, ํ๋์ค์ด, ๋ ์ผ์ด, ์๋์ด, ์คํ์ธ์ด๋ฅผ ์ง์ํฉ๋๋ค. | |
3. **์์ ์์กด์ฑ ์ ๊ฑฐ:** ์ด ๋ชจ๋ธ์ ๊ฐ๋ ฅํ ์ผ๋ฐํ ๋ฅ๋ ฅ์ ๊ฐ์ง๊ณ ์์ผ๋ฉฐ, TTS๊ฐ ์์์ ์์กดํ์ง ์์ต๋๋ค. ๋ชจ๋ ์ธ์ด ์คํฌ๋ฆฝํธ ํ ์คํธ๋ฅผ ์์ฝ๊ฒ ์ฒ๋ฆฌํ ์ ์์ต๋๋ค. | |
4. **๋์ ์ ํ๋:** ์์ด ํ ์คํธ ๊ธฐ์ค 5๋ถ ๊ธฐ์ค์์ ๋จ, 2%์ ๋ฌธ์ ์ค๋ฅ์จ(CER)๊ณผ ๋จ์ด ์ค๋ฅ์จ(WER)์ ๋ฌ์ฑํฉ๋๋ค. | |
5. **๋น ๋ฅธ ์๋:** fish-tech ๊ฐ์์ ํตํด ์ค์๊ฐ ์ธ์(RTF)๋ Nvidia RTX 4060 ๋ ธํธ๋ถ์์๋ ์ฝ 1:5, Nvidia RTX 4090์์๋ 1:15์ ๋๋ค. | |
6. **์น UI ์ถ๋ก :** Chrome, Firefox, Edge ๋ฑ ๋ค์ํ ๋ธ๋ผ์ฐ์ ์์ ํธํ๋๋ Gradio ๊ธฐ๋ฐ์ ์ฌ์ฉํ๊ธฐ ์ฌ์ด ์น UI๋ฅผ ์ ๊ณตํฉ๋๋ค. | |
7. **GUI ์ถ๋ก :** PyQt6 ๊ทธ๋ํฝ ์ธํฐํ์ด์ค๋ฅผ ์ ๊ณตํ์ฌ API ์๋ฒ์ ์ํํ๊ฒ ์๋ํฉ๋๋ค. Linux, Windows ๋ฐ macOS๋ฅผ ์ง์ํฉ๋๋ค. [GUI ์ฐธ์กฐ](https://github.com/AnyaCoder/fish-speech-gui). | |
8. **๋ฐฐํฌ ์นํ์ :** Linux, Windows, macOS์์ ๋ค์ดํฐ๋ธ๋ก ์ง์๋๋ ์ถ๋ก ์๋ฒ๋ฅผ ์ฝ๊ฒ ์ค์ ํ ์ ์์ด ์๋ ์์ค์ ์ต์ํํฉ๋๋ค. | |
## ๋ฉด์ฑ ์กฐํญ | |
์ด ์ฝ๋๋ฒ ์ด์ค์ ๋ถ๋ฒ์ ์ฌ์ฉ์ ๋ํด ์ด๋ ํ ์ฑ ์๋ ์ง์ง ์์ต๋๋ค. DMCA ๋ฐ ๊ด๋ จ ๋ฒ๋ฅ ์ ๋ํ ๋ก์ปฌ ๋ฒ๋ฅ ์ ์ฐธ์กฐํ์ญ์์ค. | |
## ์จ๋ผ์ธ ๋ฐ๋ชจ | |
[Fish Audio](https://fish.audio) | |
## ๋ก์ปฌ ์ถ๋ก ์ ์ํ ๋น ๋ฅธ ์์ | |
[inference.ipynb](/inference.ipynb) | |
## ์์ | |
#### V1.4 ๋ฐ๋ชจ ์์: [Youtube](https://www.youtube.com/watch?v=Ghc8cJdQyKQ) | |
## ๋ฌธ์ | |
- [English](https://speech.fish.audio/) | |
- [ไธญๆ](https://speech.fish.audio/zh/) | |
- [ๆฅๆฌ่ช](https://speech.fish.audio/ja/) | |
- [Portuguese (Brazil)](https://speech.fish.audio/pt/) | |
- [ํ๊ตญ์ด](https://speech.fish.audio/ko/) | |
## Samples (2024/10/02 V1.4) | |
- [English](https://speech.fish.audio/samples/) | |
- [ไธญๆ](https://speech.fish.audio/zh/samples/) | |
- [ๆฅๆฌ่ช](https://speech.fish.audio/ja/samples/) | |
- [Portuguese (Brazil)](https://speech.fish.audio/pt/samples/) | |
- [ํ๊ตญ์ด](https://speech.fish.audio/ko/samples/) | |
## Credits | |
- [VITS2 (daniilrobnikov)](https://github.com/daniilrobnikov/vits2) | |
- [Bert-VITS2](https://github.com/fishaudio/Bert-VITS2) | |
- [GPT VITS](https://github.com/innnky/gpt-vits) | |
- [MQTTS](https://github.com/b04901014/MQTTS) | |
- [GPT Fast](https://github.com/pytorch-labs/gpt-fast) | |
- [GPT-SoVITS](https://github.com/RVC-Boss/GPT-SoVITS) | |
## Sponsor | |
<div> | |
<a href="https://6block.com/"> | |
<img src="https://avatars.githubusercontent.com/u/60573493" width="100" height="100" alt="6Block Avatar"/> | |
</a> | |
<br> | |
<a href="https://6block.com/">๋ฐ์ดํฐ ์ฒ๋ฆฌ ํ์: 6Block</a> | |
</div> | |
<div> | |
<a href="https://www.lepton.ai/"> | |
<img src="https://www.lepton.ai/favicons/apple-touch-icon.png" width="100" height="100" alt="Lepton Avatar"/> | |
</a> | |
<br> | |
<a href="https://www.lepton.ai/">Fish Audio๋ Lepton.AI์์ ์ ๊ณต๋ฉ๋๋ค</a> | |
</div> | |