File size: 5,825 Bytes

---
license: mit
language:
- en
- ru
- zh
tags:
- deepfake
- cloning voce
- tacotron2
- waveglow
- ebsynth
---
[![License: MIT v1.0](https://img.shields.io/badge/license-MIT-blue.svg)](https://github.com/wladradchenko/wunjo.wladradchenko.ru/blob/main/LICENSE)

<div id="top"></div>

<br />
<div align="center">

  <a href="https://github.com/wladradchenko/wunjo.wladradchenko.ru">
    <img src="https://raw.githubusercontent.com/wladradchenko/wunjo.wladradchenko.ru/main/example/man.gif" alt="Logo" width="180" height="180">
  </a>
    
  <h3 align="center">Wunjo AI: Advanced Speech & Deepfake Neural Network Tool</h3>

  <p align="center">
    <a href="https://github.com/wladradchenko/wunjo.wladradchenko.ru/wiki">Documentation</a>
    <br/>
    <a href="https://github.com/wladradchenko/wunjo.wladradchenko.ru/issues">Issue</a>
    ·
    <a href="https://github.com/wladradchenko/wunjo.wladradchenko.ru/discussions">Discussions</a>
    ·
    <a href="https://youtube.com/playlist?list=PLJG0sD6007zFJyV78mkU-KW2UxbirgTGr&feature=shared">Tutorial</a>
  </p>
</div>


<!-- ABOUT THE PROJECT -->
## About

This is reserve cloud drive to keep AI models for Wunjo AI. Unlock the unparalleled capabilities of neural networks with Wunjo AI. Whether you're delving into speech synthesis, crafting deepfake animations, drawing Stable Diffusion video by text prompt or video making, Wunjo AI has got you covered.

**What is keep in this repo?:**

- **Speech Synthesis:** [Models](https://huggingface.co/wladradchenko/wunjo.wladradchenko.ru/tree/main/voice) Tacotron2 and Waveglow to speech synthesis.
- **Voice Cloning:** [Models](https://huggingface.co/wladradchenko/wunjo.wladradchenko.ru/tree/main/rtvc) Encoder, Vocoder and Synthesizer to cloning voice.
- **Video-to-Video by Text Prompt:** Ebsynth build for CUDA 11.8 on [Linux](https://huggingface.co/wladradchenko/wunjo.wladradchenko.ru/resolve/main/ebsynth/ebsynth_linux_cu118) and [Windows](https://huggingface.co/wladradchenko/wunjo.wladradchenko.ru/resolve/main/ebsynth/EbSynth-Beta-Win.zip).
- **Deepfake Animation:**
  - [Model](https://huggingface.co/wladradchenko/wunjo.wladradchenko.ru/resolve/main/deepfake/faceswap.onnx) to swap faces in videos, GIFs, and photos using just a single photograph.
  - [Model](https://huggingface.co/wladradchenko/wunjo.wladradchenko.ru/resolve/main/deepfake/emo2lip.pth) to change the emotions of a person in the video, with the help of a text description with Wav2lip architecture.
- **AI Retouch Tool:** Model to [remove object](https://huggingface.co/wladradchenko/wunjo.wladradchenko.ru/resolve/main/deepfake/retouch_object.pth) from video or image and model to [retouch face](https://huggingface.co/wladradchenko/wunjo.wladradchenko.ru/resolve/main/deepfake/retouch_face.pth).
- **Automatic Segmentation Mask:** Models in ONNX format for segmentation [small](https://huggingface.co/wladradchenko/wunjo.wladradchenko.ru/resolve/main/deepfake/vit_b_quantized.onnx) and [large](https://huggingface.co/wladradchenko/wunjo.wladradchenko.ru/resolve/main/deepfake/vit_h_quantized.onnx).

**Applications:**
From voiceovers in commercials to character voicing in games, from audiobook narrations to fun deepfake projects, Wunjo AI offers endless possibilities and all is free and local on your device.

**Why Choose Wunjo AI?:**

- **All-in-One:** A comprehensive tool catering to both your voice and visual AI needs.
- **User-friendly:** Designed for all, from beginners to professionals.
- **Privacy First:** Functions locally on your desktop, ensuring your data remains private.
- **Open-source & Free:** Benefit from community-driven enhancements and enjoy the app without any cost.

Step into the future of AI-powered creativity with Wunjo AI.

<!-- DONAT -->
## Support the Project

You can support the author of the project in the development of his creative ideas, or just treat him to [a cup of coffee](https://www.buymeacoffee.com/wladradchenko) in USD or [a slice of pizza](https://wladradchenko.ru/donat) in RUB. There are other ways to support the development of the project, more details on [page](https://github.com/wladradchenko/wunjo.wladradchenko.ru/wiki/Support-the-Project).

<div align="center">
  <table>
  <tr>
    <th>Buy a cup of coffee in USD</th>
    <th>Buy a slice of pizza in RUB</th>
  </tr>
  <tr align="center">
    <td><img src="https://github.com/wladradchenko/wunjo.wladradchenko.ru/assets/56233697/bc6eefa2-705f-4307-89fd-85d96ec29917" alt="pizza" width="250"></td>
    <td><img src="https://github.com/wladradchenko/wunjo.wladradchenko.ru/assets/56233697/acc80acd-0e39-4476-88db-0a10f2098e25" alt="coffee" width="250"></td>
  </tr>
</table>
</div>

<!-- CONTACT -->
## Contact

Owner: [Wladislav Radchenko](https://github.com/wladradchenko/)

Email: [[email protected]]([email protected])

Project on GitHub: [https://github.com/wladradchenko/wunjo.wladradchenko.ru](https://github.com/wladradchenko/wunjo.wladradchenko.ru)

Web site: [wladradchenko.ru/wunjo](https://wladradchenko.ru/wunjo)

<!-- PREMISE -->
## Premise

Wunjo comes from the ancient runic alphabet and represents joy and contentment, which could tie into the idea of using the application to create engaging and expressive speech. Vunyo (ᚹ) is the eighth rune of the Elder and Anglo-Saxon Futhark. Prior to the introduction of the letter W into the Latin alphabet, the letter Ƿynn (Ƿƿ) was used instead in English, derived from this rune.

<!-- CREDITS -->
## Credits

* Tacatron 2 - https://github.com/NVIDIA/tacotron2
* Waveglow - https://github.com/NVIDIA/waveglow
* Real-Time Voice Cloning - https://github.com/CorentinJ/Real-Time-Voice-Cloning
* Segment Anything - https://github.com/facebookresearch/segment-anything
* Ebsynth - https://github.com/jamriska/ebsynth
  
<p align="right">(<a href="#top">to top</a>)</p>