NucleusAI
/

Nucleus-Image

@@ -12,9 +12,7 @@ tags:
 - image-generation
 ---
-<p align="center">
-    <img src="assets/logo/nucleus_header.png" width="400"/>
-</p>
 <p align="center">
     🖥️ <a href="https://github.com/WithNucleusAI/Nucleus-Image"><b>GitHub</b></a>&nbsp;&nbsp; | &nbsp;&nbsp;🤗 <a href="https://huggingface.co/NucleusAI/NucleusMoE-Image"><b>Hugging Face</b></a>&nbsp;&nbsp; | &nbsp;&nbsp;📑 <a href=""><b>Tech Report</b></a>
 </p>
@@ -34,7 +32,7 @@ tags:
 ## Architecture
-![Architecture](assets/Architecture_Diagram.png)
 Nucleus-Image is a 32-layer diffusion transformer where 29 of the 32 blocks replace the dense FFN with a sparse MoE layer containing 64 routed experts and one shared expert (the first 3 layers use dense FFN for training stability). Image queries attend to concatenated image and text key-value pairs via joint attention — text tokens are excluded from the transformer backbone entirely, participating only as KV contributors. This eliminates MoE routing overhead for text and enables full text KV caching across denoising steps.
@@ -60,7 +58,7 @@ Routing uses **Expert-Choice** with a **decoupled design**: the router receives
 ## Benchmark Results
-![Overall Performance](assets/Overall-Performance.png)
 Nucleus-Image achieves state-of-the-art or near state-of-the-art results on all three benchmarks despite activating only ~2B of its 17B parameters per forward pass. All results are from the base model at 1024x1024, 50 inference steps, CFG scale 8.0.
@@ -125,22 +123,22 @@ image.save("nucleus_output.png")
 Nucleus-Image generations of human subjects and portraits, spanning diverse cultures, ages, and artistic styles — from expressive character studies to fine-grained close-ups with intricate skin texture and detail.
-![](assets/collage/Collage-1-Top.jpeg)
-![](assets/collage/Collage-1-Bottom.jpeg)
 ### Fantasy, Surrealism & Nature
 Nucleus-Image generations spanning fantasy, surrealism, animation, and the natural world.
-![](assets/collage/Collage-2-Top.jpeg)
-![](assets/collage/Collage-2-Bottom.jpeg)
 ### Commercial & Everyday Imagery
 Nucleus-Image generations across product photography, architecture, typography, food, and world culture — demonstrating versatility in commercial, conceptual, and everyday imagery.
-![](assets/collage/Collage-3-Top.jpeg)
-![](assets/collage/Collage-3-Bottom.jpeg)
 ## License

 - image-generation
 ---
+<p align="center"> <img src="https://storage.googleapis.com/nucleus_image_v1/nucleus_header.png" width="400"/></p>
 <p align="center">
     🖥️ <a href="https://github.com/WithNucleusAI/Nucleus-Image"><b>GitHub</b></a>&nbsp;&nbsp; | &nbsp;&nbsp;🤗 <a href="https://huggingface.co/NucleusAI/NucleusMoE-Image"><b>Hugging Face</b></a>&nbsp;&nbsp; | &nbsp;&nbsp;📑 <a href=""><b>Tech Report</b></a>
 </p>
 ## Architecture
+![Architecture](https://storage.googleapis.com/nucleus_image_v1/Architecture_Diagram.png)
 Nucleus-Image is a 32-layer diffusion transformer where 29 of the 32 blocks replace the dense FFN with a sparse MoE layer containing 64 routed experts and one shared expert (the first 3 layers use dense FFN for training stability). Image queries attend to concatenated image and text key-value pairs via joint attention — text tokens are excluded from the transformer backbone entirely, participating only as KV contributors. This eliminates MoE routing overhead for text and enables full text KV caching across denoising steps.
 ## Benchmark Results
+![Overall Performance](https://storage.googleapis.com/nucleus_image_v1/Overall-Performance.png)
 Nucleus-Image achieves state-of-the-art or near state-of-the-art results on all three benchmarks despite activating only ~2B of its 17B parameters per forward pass. All results are from the base model at 1024x1024, 50 inference steps, CFG scale 8.0.
 Nucleus-Image generations of human subjects and portraits, spanning diverse cultures, ages, and artistic styles — from expressive character studies to fine-grained close-ups with intricate skin texture and detail.
+![](https://storage.googleapis.com/nucleus_image_v1/Collage-1-Top.jpeg)
+![](https://storage.googleapis.com/nucleus_image_v1/Collage-1-Bottom.jpeg)
 ### Fantasy, Surrealism & Nature
 Nucleus-Image generations spanning fantasy, surrealism, animation, and the natural world.
+![](https://storage.googleapis.com/nucleus_image_v1/Collage-2-Top.jpeg)
+![](https://storage.googleapis.com/nucleus_image_v1/Collage-2-Bottom.jpeg)
 ### Commercial & Everyday Imagery
 Nucleus-Image generations across product photography, architecture, typography, food, and world culture — demonstrating versatility in commercial, conceptual, and everyday imagery.
+![](https://storage.googleapis.com/nucleus_image_v1/Collage-3-Top.jpeg)
+![](https://storage.googleapis.com/nucleus_image_v1/Collage-3-Bottom.jpeg)
 ## License