irow
/

conditional-audio-diffusion

Model card Files Files and versions Community

irow commited on Feb 20, 2023

Commit

4f3da47

·

1 Parent(s): 8391db3

Update README.md

Files changed (1) hide show

README.md +8 -0

README.md CHANGED Viewed

@@ -5,3 +5,11 @@ pipeline_tag: text-to-speech
 ---
 This is a basic audio diffusion model using Unet. I've uploaded the weights and training code. The sample method of the model is used to generate whatever spoken digit you want.

 ---
 This is a basic audio diffusion model using Unet. I've uploaded the weights and training code. The sample method of the model is used to generate whatever spoken digit you want.
+![alt text](sample24_4_6.jpg "Title") ![alt text]( sample24_5_5.jpg
+ "Title") ![alt text]( sample24_6_3.jpg
+ "Title") ![alt text]( sample24_7_2.jpg
+ "Title")
+The images found in the files are sample{epoch}_{sample#}_{digit}.jpg. They also have corresponding audio files.
+The audio is VERY quiet, so turn up the speakers to hear better. (Just don't forget to turn it down after!)