| | --- |
| | license: apache-2.0 |
| | language: |
| | - en |
| | base_model: |
| | - stabilityai/stable-diffusion-3.5-medium |
| | pipeline_tag: text-to-image |
| | --- |
| | # Model Description |
| |
|
| | This is the official weight of "Efficient Multi-Instance Generation with Janus-Pro-Dirven Prompt Parsing". We propose an efficient Text-to-Image model that can generate high-quality images with reasonable layouts according to the requirements. |
| |
|
| | You can read our paper on [arXiv](https://arxiv.org/abs/2503.21069) to dive deeper into the theoretical foundations and experiments. |
| |
|
| |
|
| |  |
| |
|
| | # Todo List: |
| |
|
| | - The complete codebase will be hosted on GitHub |
| | - The full dataset will be made available through Hugging Face Datasets |