Update README.md
Browse files
README.md
CHANGED
|
@@ -10,9 +10,7 @@
|
|
| 10 |
[Yu Qiao](https://scholar.google.com/citations?user=gFtI-8QAAAAJ&hl),
|
| 11 |
[Yali Wang†](https://scholar.google.com/citations?user=hD948dkAAAAJ)
|
| 12 |
|
| 13 |
-
[](https://arxiv.org/abs/2408.10605)
|
| 14 |
-
[](https://github.com/DINGYANB/MUSES)
|
| 15 |
-
[](https://huggingface.co/yanboding/MUSES/)
|
| 16 |
|
| 17 |
</div>
|
| 18 |
|
|
@@ -20,7 +18,7 @@
|
|
| 20 |
|
| 21 |
Despite recent advancements in text-to-image generation, most existing methods struggle to create images with multiple objects and complex spatial relationships in 3D world. To tackle this limitation, we introduce a generic AI system, namely MUSES, for 3D-controllable image generation from user queries.
|
| 22 |
|
| 23 |
-
<img width="800" alt="image" src="https://
|
| 24 |
</a>
|
| 25 |
|
| 26 |
|
|
@@ -33,7 +31,7 @@ Our MUSES realize 3D controllable image generation by developing a progressive w
|
|
| 33 |
|
| 34 |
By mimicking the collaboration of human professionals, this multi-modal agent pipeline facilitates the effective and automatic creation of images with 3D-controllable objects, through an explainable integration of top-down planning and bottom-up generation.
|
| 35 |
|
| 36 |
-
<img width="800" alt="image" src="https://
|
| 37 |
</a>
|
| 38 |
|
| 39 |
|
|
|
|
| 10 |
[Yu Qiao](https://scholar.google.com/citations?user=gFtI-8QAAAAJ&hl),
|
| 11 |
[Yali Wang†](https://scholar.google.com/citations?user=hD948dkAAAAJ)
|
| 12 |
|
| 13 |
+
[](https://arxiv.org/abs/2408.10605) [](https://github.com/DINGYANB/MUSES) [](https://huggingface.co/yanboding/MUSES/)
|
|
|
|
|
|
|
| 14 |
|
| 15 |
</div>
|
| 16 |
|
|
|
|
| 18 |
|
| 19 |
Despite recent advancements in text-to-image generation, most existing methods struggle to create images with multiple objects and complex spatial relationships in 3D world. To tackle this limitation, we introduce a generic AI system, namely MUSES, for 3D-controllable image generation from user queries.
|
| 20 |
|
| 21 |
+
<img width="800" alt="image" src="https://huggingface.co/yanboding/MUSES/blob/main/demo.png">
|
| 22 |
</a>
|
| 23 |
|
| 24 |
|
|
|
|
| 31 |
|
| 32 |
By mimicking the collaboration of human professionals, this multi-modal agent pipeline facilitates the effective and automatic creation of images with 3D-controllable objects, through an explainable integration of top-down planning and bottom-up generation.
|
| 33 |
|
| 34 |
+
<img width="800" alt="image" src="https://huggingface.co/yanboding/MUSES/blob/main/overview.png">
|
| 35 |
</a>
|
| 36 |
|
| 37 |
|