This is another dance diffusion AI similar to Blunstron, except this time it generates harmonizing voices, and sings in either E or G half sharp. I finetuned it from Blunstron on a dataset of 3-second clips of Jacob Collier - In The Bleak Midwinter (which like Blunstron's dataset of The Alan Parsons Project - Old and Wise, I have sampled dozens of times in my "audio collage" music). However the samples were too quiet for my tastes and always sung in F, because it latched on to the beginning of the song. So I halted the training, removed all the quieter parts from the dataset, then resumed from where I left off so it could keep the knowledge it got already. This time I decided to preserve the entire history of checkpoints, since each once is unique.

Checkpoints

epoch=948-step=192000.ckpt is probably the best one, and it's the one I'll probably use. It sings in E and G half sharp, is varied from the original dataset, and is sometimes composed and sometimes wild, for the variety.
epoch=1019-step=192500.ckpt seems to be overfit, since a lot of the stuff it generates I can immediately hear where in the song it came from, from memory alone. Its perfect sound doesn't leave much room for creativity and also definitely qualifies as art theft. Like the previous, it sings in E and G half sharp.
epoch=876-step=191500.ckpt is from before the deletion of most of the dataset. It sings in F and is relatively quiet and composed.
epoch=845-step=191000.ckpt I have no training samples from, and haven't tested. It's quite likely it might share some aspects with Blunstron.

Downloads last month: -

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for kronosta/ColliErgoSum

Base model

harmonai/jmann-small-190k

Finetuned

kronosta/blunstron

Finetuned

(1)

this model