File size: 249 Bytes
9a7c45c
 
 
 
8f3ccae
 
 
1
2
3
4
5
6
7
8
---
datasets:
- flexthink/audiomnist
pipeline_tag: text-to-speech
---

This is a basic audio diffusion model using Unet. I've uploaded the weights and training code. The sample method of the model is used to generate whatever spoken digit you want.