schrum2
/

MarioDiffusion-GTE-multiple-negative0

TextConditionalDDPMPipeline

Model card Files Files and versions

MarioDiffusion-GTE-multiple-negative0 / README.md

schrum2's picture

Update README.md

aff4bd3 verified 8 months ago

|

history blame contribute delete

1.18 kB

	---
	license: mit
	---

	Details on the code used to produce and use this model are available at:

	https://github.com/schrum2/MarioDiffusion

	That repo has instructions to check out this model and apply it to the generation of Super Mario Bros. level scenes.
	There is also an interactive GUI for constructing complete levels out of model-generated scenes.

	This model makes use of https://huggingface.co/Alibaba-NLP/gte-large-en-v1.5
	as a text embedding model for use with diffusion to generate Mario levels.
	It makes use of negative guidance during diffusion training, and encodes each
	of several phrases within each caption as a distinct text embedding.
	Our results indicate that it has relatively poor performance, so it is made available
	only to allow full scrutiny of our results.

	To see a model using Alibaba-NLP/gte-large-en-v1.5 that performs better with
	multiple text embeddings without using negative guidance,
	see https://huggingface.co/schrum2/MarioDiffusion-GTE-multiple-regular0.
	To see a model that uses
	a simple token-based transformer model for text embedding
	and makes use of negative guidance, see https://huggingface.co/schrum2/MarioDiffusion-MLM-negative0.