amuse / README.md

Create README.md

eb3bcd5 verified over 1 year ago

308 Bytes

This is a conditional unet model designed for music generation using mel spectrogram images. The model was trained on the alppo/music dataset, which includes 5 different genres. It accepts 512x512 images and 1x64 condition embeddings, which can be obtained from my own variational autoencoder implementation.