irow
/

conditional-audio-diffusion

Model card Files Files and versions

conditional-audio-diffusion / README.md

irow's picture

Update README.md

8f3ccae almost 3 years ago

|

249 Bytes

	---
	datasets:
	- flexthink/audiomnist
	pipeline_tag: text-to-speech
	---

	This is a basic audio diffusion model using Unet. I've uploaded the weights and training code. The sample method of the model is used to generate whatever spoken digit you want.