|
|
--- |
|
|
license: apache-2.0 |
|
|
language: |
|
|
- en |
|
|
- zh |
|
|
pipeline_tag: text-to-image |
|
|
library_name: diffusers |
|
|
tags: |
|
|
- diffusers |
|
|
- comfyui |
|
|
- quantization |
|
|
- quant |
|
|
- fp8 |
|
|
--- |
|
|
|
|
|
This is a quantization of [Comfy-Org/z_image_turbo](https://huggingface.co/Comfy-Org/z_image_turbo/tree/main) to FP8_E5M2 and FP8_E4M3FN </h1> |
|
|
|
|
|
| Precision | Image 1 | Image 2 | |
|
|
|-----------|---------|---------| |
|
|
| bf16 | <img src="https://cdn-uploads.huggingface.co/production/uploads/63473b59e5c0717e6737b872/urBHqJrWreaZ0cXc4dvUN.png" width="99%"> | <img src="https://cdn-uploads.huggingface.co/production/uploads/63473b59e5c0717e6737b872/3TXwtKh7SIkOUI8ZmkQJa.png" width="99%"> | |
|
|
| fp8_e4m3fn | <img src="https://cdn-uploads.huggingface.co/production/uploads/63473b59e5c0717e6737b872/h-uoIK9GqHwYzCwaFjais.png" width="99%"> | <img src="https://cdn-uploads.huggingface.co/production/uploads/63473b59e5c0717e6737b872/tIr-8lkIcASLL7owsMKxn.png" width="99%"> | |
|
|
|
|
|
|
|
|
<h1 align="center">⚡️- Image<br><sub><sup>An Efficient Image Generation Foundation Model with Single-Stream Diffusion Transformer</sup></sub></h1> |
|
|
|
|
|
<div align="center"> |
|
|
|
|
|
[](https://tongyi-mai.github.io/Z-Image-homepage/)  |
|
|
[](https://github.com/Tongyi-MAI/Z-Image)  |
|
|
[](https://huggingface.co/Tongyi-MAI/Z-Image-Turbo)  |
|
|
[](https://huggingface.co/spaces/Tongyi-MAI/Z-Image-Turbo)  |
|
|
[](https://www.modelscope.cn/models/Tongyi-MAI/Z-Image-Turbo)  |
|
|
[](https://www.modelscope.cn/aigc/imageGeneration?tab=advanced&versionId=469191&modelType=Checkpoint&sdVersion=Z_IMAGE_TURBO&modelUrl=modelscope%253A%252F%252FTongyi-MAI%252FZ-Image-Turbo%253Frevision%253Dmaster%7D%7BOnline)  |
|
|
[](assets/Z-Image-Gallery.pdf)  |
|
|
[](https://modelscope.cn/studios/Tongyi-MAI/Z-Image-Gallery/summary)  |
|
|
<a href="http://github.com/Tongyi-MAI/Z-Image/blob/main/Z_Image_Report.pdf" target="_blank"><img src="https://img.shields.io/badge/Report-b5212f.svg?logo=arxiv" height="21px"></a> |
|
|
|
|
|
|
|
|
Welcome to the official repository for the Z-Image(造相)project! |
|
|
|
|
|
</div> |
|
|
|
|
|
|
|
|
|
|
|
## ✨ Z-Image |
|
|
|
|
|
Z-Image is a powerful and highly efficient image generation model with **6B** parameters. It is currently has three variants: |
|
|
|
|
|
- 🚀 **Z-Image-Turbo** – A distilled version of Z-Image that matches or exceeds leading competitors with only **8 NFEs** (Number of Function Evaluations). It offers **⚡️sub-second inference latency⚡️** on enterprise-grade H800 GPUs and fits comfortably within **16G VRAM consumer devices**. It excels in photorealistic image generation, bilingual text rendering (English & Chinese), and robust instruction adherence. |
|
|
|
|
|
- 🧱 **Z-Image-Base** – The non-distilled foundation model. By releasing this checkpoint, we aim to unlock the full potential for community-driven fine-tuning and custom development. |
|
|
|
|
|
- ✍️ **Z-Image-Edit** – A variant fine-tuned on Z-Image specifically for image editing tasks. It supports creative image-to-image generation with impressive instruction-following capabilities, allowing for precise edits based on natural language prompts. |
|
|
|
|
|
### 📥 Model Zoo |
|
|
|
|
|
| Model | Hugging Face | ModelScope | |
|
|
| :--- |:--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|:----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------| |
|
|
| **Z-Image-Turbo** | [](https://huggingface.co/Tongyi-MAI/Z-Image-Turbo) <br> [](https://huggingface.co/spaces/Tongyi-MAI/Z-Image-Turbo) | [](https://www.modelscope.cn/models/Tongyi-MAI/Z-Image-Turbo) <br> [](https://www.modelscope.cn/aigc/imageGeneration?tab=advanced&versionId=469191&modelType=Checkpoint&sdVersion=Z_IMAGE_TURBO&modelUrl=modelscope%3A%2F%2FTongyi-MAI%2FZ-Image-Turbo%3Frevision%3Dmaster) | |
|
|
| **Z-Image-Base** | *To be released* | *To be released* | |
|
|
| **Z-Image-Edit** | *To be released* | *To be released* | |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|