Update README.md
Browse files
README.md
CHANGED
@@ -5,3 +5,11 @@ pipeline_tag: text-to-speech
|
|
5 |
---
|
6 |
|
7 |
This is a basic audio diffusion model using Unet. I've uploaded the weights and training code. The sample method of the model is used to generate whatever spoken digit you want.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
5 |
---
|
6 |
|
7 |
This is a basic audio diffusion model using Unet. I've uploaded the weights and training code. The sample method of the model is used to generate whatever spoken digit you want.
|
8 |
+
   
|
12 |
+
|
13 |
+
The images found in the files are sample{epoch}_{sample#}_{digit}.jpg. They also have corresponding audio files.
|
14 |
+
The audio is VERY quiet, so turn up the speakers to hear better. (Just don't forget to turn it down after!)
|
15 |
+
|