metadata
license: apache-2.0
language:
- en
base_model:
- stabilityai/stable-diffusion-3.5-medium
pipeline_tag: text-to-image
Model Description
This is the official weight of "Efficient Multi-Instance Generation with Janus-Pro-Dirven Prompt Parsing". We propose an efficient Text-to-Image model that can generate high-quality images with reasonable layouts according to the requirements.
You can read our paper on arXiv to dive deeper into the theoretical foundations and experiments.
Todo List:
- The complete codebase will be hosted on GitHub
- The full dataset will be made available through Hugging Face Datasets
