conditional-audio-diffusion / README.md

irow

Update README.md

8f3ccae almost 3 years ago

preview code

raw

history blame

249 Bytes

metadata

datasets:
  - flexthink/audiomnist
pipeline_tag: text-to-speech

This is a basic audio diffusion model using Unet. I've uploaded the weights and training code. The sample method of the model is used to generate whatever spoken digit you want.