license: mit
Details on the code used to produce and use this model are available at:
https://github.com/schrum2/MarioDiffusion
That repo has instructions to check out this model and apply it to the generation of Super Mario Bros. level scenes. There is also an interactive GUI for constructing complete levels out of model-generated scenes.
This model makes use of https://huggingface.co/Alibaba-NLP/gte-large-en-v1.5 as a text embedding model for use with diffusion to generate Mario levels. It makes use of slightly complicated absence style captions. It is made available to allow full scrutiny of our results, but the data indicates that this model performs poorly in comparison with others.
To see a model using Alibaba-NLP/gte-large-en-v1.5 that has reasonable performance, see https://huggingface.co/schrum2/MarioDiffusion-GTE-single-regular0. To see a model that uses absence captions with a simple token-based transformer model for text embedding, see https://huggingface.co/schrum2/MarioDiffusion-MLM-absence0.