shimmyshimmer commited on Feb 12

Commit

683eadd

0 Parent(s):

Duplicate from unsloth/Z-Image-GGUF

Browse files

Co-authored-by: Michael Han <shimmyshimmer@users.noreply.huggingface.co>

Files changed (19) hide show

.gitattributes +52 -0
README.md +147 -0
teaser.jpg +3 -0
z-image-BF16.gguf +3 -0
z-image-F16.gguf +3 -0
z-image-Q2_K.gguf +3 -0
z-image-Q3_K_L.gguf +3 -0
z-image-Q3_K_M.gguf +3 -0
z-image-Q3_K_S.gguf +3 -0
z-image-Q4_0.gguf +3 -0
z-image-Q4_1.gguf +3 -0
z-image-Q4_K_M.gguf +3 -0
z-image-Q4_K_S.gguf +3 -0
z-image-Q5_0.gguf +3 -0
z-image-Q5_1.gguf +3 -0
z-image-Q5_K_M.gguf +3 -0
z-image-Q5_K_S.gguf +3 -0
z-image-Q6_K.gguf +3 -0
z-image-Q8_0.gguf +3 -0

.gitattributes ADDED Viewed

	@@ -0,0 +1,52 @@

+*.7z filter=lfs diff=lfs merge=lfs -text
+*.arrow filter=lfs diff=lfs merge=lfs -text
+*.bin filter=lfs diff=lfs merge=lfs -text
+*.bz2 filter=lfs diff=lfs merge=lfs -text
+*.ckpt filter=lfs diff=lfs merge=lfs -text
+*.ftz filter=lfs diff=lfs merge=lfs -text
+*.gz filter=lfs diff=lfs merge=lfs -text
+*.h5 filter=lfs diff=lfs merge=lfs -text
+*.joblib filter=lfs diff=lfs merge=lfs -text
+*.lfs.* filter=lfs diff=lfs merge=lfs -text
+*.mlmodel filter=lfs diff=lfs merge=lfs -text
+*.model filter=lfs diff=lfs merge=lfs -text
+*.msgpack filter=lfs diff=lfs merge=lfs -text
+*.npy filter=lfs diff=lfs merge=lfs -text
+*.npz filter=lfs diff=lfs merge=lfs -text
+*.onnx filter=lfs diff=lfs merge=lfs -text
+*.ot filter=lfs diff=lfs merge=lfs -text
+*.parquet filter=lfs diff=lfs merge=lfs -text
+*.pb filter=lfs diff=lfs merge=lfs -text
+*.pickle filter=lfs diff=lfs merge=lfs -text
+*.pkl filter=lfs diff=lfs merge=lfs -text
+*.pt filter=lfs diff=lfs merge=lfs -text
+*.pth filter=lfs diff=lfs merge=lfs -text
+*.rar filter=lfs diff=lfs merge=lfs -text
+*.safetensors filter=lfs diff=lfs merge=lfs -text
+saved_model/**/* filter=lfs diff=lfs merge=lfs -text
+*.tar.* filter=lfs diff=lfs merge=lfs -text
+*.tar filter=lfs diff=lfs merge=lfs -text
+*.tflite filter=lfs diff=lfs merge=lfs -text
+*.tgz filter=lfs diff=lfs merge=lfs -text
+*.wasm filter=lfs diff=lfs merge=lfs -text
+*.xz filter=lfs diff=lfs merge=lfs -text
+*.zip filter=lfs diff=lfs merge=lfs -text
+*.zst filter=lfs diff=lfs merge=lfs -text
+*tfevents* filter=lfs diff=lfs merge=lfs -text
+teaser.jpg filter=lfs diff=lfs merge=lfs -text
+z-image-F16.gguf filter=lfs diff=lfs merge=lfs -text
+z-image-Q2_K.gguf filter=lfs diff=lfs merge=lfs -text
+z-image-Q3_K_M.gguf filter=lfs diff=lfs merge=lfs -text
+z-image-Q3_K_S.gguf filter=lfs diff=lfs merge=lfs -text
+z-image-Q4_0.gguf filter=lfs diff=lfs merge=lfs -text
+z-image-Q4_1.gguf filter=lfs diff=lfs merge=lfs -text
+z-image-Q4_K_M.gguf filter=lfs diff=lfs merge=lfs -text
+z-image-Q4_K_S.gguf filter=lfs diff=lfs merge=lfs -text
+z-image-Q5_0.gguf filter=lfs diff=lfs merge=lfs -text
+z-image-Q5_1.gguf filter=lfs diff=lfs merge=lfs -text
+z-image-Q5_K_M.gguf filter=lfs diff=lfs merge=lfs -text
+z-image-Q5_K_S.gguf filter=lfs diff=lfs merge=lfs -text
+z-image-Q6_K.gguf filter=lfs diff=lfs merge=lfs -text
+z-image-Q8_0.gguf filter=lfs diff=lfs merge=lfs -text
+z-image-BF16.gguf filter=lfs diff=lfs merge=lfs -text
+z-image-Q3_K_L.gguf filter=lfs diff=lfs merge=lfs -text

README.md ADDED Viewed

	@@ -0,0 +1,147 @@

+---
+base_model: Tongyi-MAI/Z-Image
+license: apache-2.0
+language:
+- en
+pipeline_tag: text-to-image
+library_name: ggml
+tags:
+- gguf
+- unsloth
+- quantized
+---
+This is a GGUF quantized version of [Z-Image](https://huggingface.co/Tongyi-MAI/Z-Image). <br>
+unsloth/Z-Image-GGUF uses [Unsloth Dynamic 2.0](https://docs.unsloth.ai/basics/unsloth-dynamic-2.0-ggufs) methodology for SOTA performance.
+- Important layers are upcasted to higher precision.
+- Uses tooling from [ComfyUI-GGUF](https://github.com/city96/ComfyUI-GGUF) by city96.
+<div>
+  <div style="display: flex; gap: 5px; align-items: center; ">
+    <a href="https://github.com/unslothai/unsloth/">
+      <img src="https://github.com/unslothai/unsloth/raw/main/images/unsloth%20new%20logo.png" width="133">
+    </a>
+    <a href="https://discord.gg/unsloth">
+      <img src="https://github.com/unslothai/unsloth/raw/main/images/Discord%20button.png" width="173">
+    </a>
+        <a href="https://docs.unsloth.ai/basics/unsloth-dynamic-2.0-ggufs">
+      <img src="https://raw.githubusercontent.com/unslothai/unsloth/refs/heads/main/images/documentation%20green%20button.png" width="143">
+    </a>
+  </div>
+</div>
+---
+<h1 align="center">⚡️- Image<br><sub><sup>An Efficient Image Generation Foundation Model with Single-Stream Diffusion Transformer</sup></sub></h1>
+<div align="center">
+[![Official Site](https://img.shields.io/badge/Official%20Site-333399.svg?logo=homepage)](https://tongyi-mai.github.io/Z-Image-blog/)&#160;
+[![GitHub](https://img.shields.io/badge/GitHub-Z--Image-181717?logo=github&logoColor=white)](https://github.com/Tongyi-MAI/Z-Image)&#160;
+[![Hugging Face](https://img.shields.io/badge/%F0%9F%A4%97%20Checkpoint-Z--Image-yellow)](https://huggingface.co/Tongyi-MAI/Z-Image)&#160;
+[![ModelScope Model](https://img.shields.io/badge/🤖%20Checkpoint-Z--Image-624aff)](https://www.modelscope.cn/models/Tongyi-MAI/Z-Image)&#160;
+[![ModelScope Space](https://img.shields.io/badge/🤖%20Online_Demo-Z--Image-17c7a7)](https://www.modelscope.cn/aigc/imageGeneration?tab=advanced&versionId=569345&modelType=Checkpoint&sdVersion=Z_IMAGE&modelUrl=modelscope%3A%2F%2FTongyi-MAI%2FZ-Image%3Frevision%3Dmaster)&#160;
+<a href="https://arxiv.org/abs/2511.22699" target="_blank"><img src="https://img.shields.io/badge/Report-b5212f.svg?logo=arxiv" height="21px"></a>
+Welcome to the official repository for the Z-Image（造相）project!
+</div>
+## 🎨 Z-Image
+![Teaser](teaser.jpg)
+![asethetic](https://cdn-uploads.huggingface.co/production/uploads/64379d79fac5ea753f1c10f3/RftwBF4PzC0_L9GvETPZz.jpeg)
+![diverse](https://cdn-uploads.huggingface.co/production/uploads/64379d79fac5ea753f1c10f3/HiFeAD2XUTmlxgdWHwhss.jpeg)
+![negative](https://cdn-uploads.huggingface.co/production/uploads/64379d79fac5ea753f1c10f3/rECmhpZys1siGgEO8L6Fi.jpeg)
+**Z-Image** is the foundation model of the ⚡️- Image family, engineered for good quality, robust generative diversity, broad stylistic coverage, and precise prompt adherence.
+While Z-Image-Turbo is built for speed,
+Z-Image is a full-capacity, undistilled transformer designed to be the backbone for creators, researchers, and developers who require the highest level of creative freedom.
+![z-image](https://cdn-uploads.huggingface.co/production/uploads/64379d79fac5ea753f1c10f3/kt_A-s5vMQ6L-_sUjNUCG.jpeg)
+### 🌟 Key Features
+- **Undistilled Foundation**: As a non-distilled base model, Z-Image preserves the complete training signal. It supports full Classifier-Free Guidance (CFG), providing the precision required for complex prompt engineering and professional workflows.
+- **Aesthetic Versatility**: Z-Image masters a vast spectrum of visual languages—from hyper-realistic photography and cinematic digital art to intricate anime and stylized illustrations. It is the ideal engine for scenarios requiring rich, multi-dimensional expression.
+- **Enhanced Output Diversity**: Built for exploration, Z-Image delivers significantly higher variability in composition, facial identity, and lighting across different seeds, ensuring that multi-person scenes remain distinct and dynamic.
+- **Built for Development**: The ideal starting point for the community. Its non-distilled nature makes it a good base for LoRA training, structural conditioning (ControlNet) and semantic conditioning.
+- **Robust Negative Control**: Responds with high fidelity to negative prompting, allowing users to reliably suppress artifacts and adjust compositions.
+### 🆚 Z-Image vs Z-Image-Turbo
+| Aspect | Z-Image | Z-Image-Turbo |
+|------|------|------|
+| CFG | ✅ | ❌ |
+| Steps | 28~50 | 8 |
+| Fintunablity | ✅ | ❌ |
+| Negative Prompting | ✅ | ❌ |
+| Diversity | High | Low |
+| Visual Quality | High | Very High |
+| RL | ❌ | ✅ |
+## 🚀 Quick Start
+### Installation & Download
+Install the latest version of diffusers:
+```bash
+pip install git+https://github.com/huggingface/diffusers
+```
+Download the model:
+```bash
+pip install -U huggingface_hub
+HF_XET_HIGH_PERFORMANCE=1 hf download Tongyi-MAI/Z-Image
+```
+### Recommended Parameters
+- **Resolution:** 512×512 to 2048×2048 (total pixel area, any aspect ratio)
+- **Guidance scale:** 3.0 – 5.0
+- **Inference steps:** 28 – 50
+### Usage Example
+```python
+import torch
+from diffusers import ZImagePipeline
+# Load the pipeline
+pipe = ZImagePipeline.from_pretrained(
+    "Tongyi-MAI/Z-Image",
+    torch_dtype=torch.bfloat16,
+    low_cpu_mem_usage=False,
+)
+pipe.to("cuda")
+# Generate image
+prompt = "两名年轻亚裔女性紧密站在一起，背景为朴素的灰色纹理墙面，可能是室内地毯地面。左侧女性留着长卷发，身穿藏青色毛衣，左袖有奶油色褶皱装饰，内搭白色立领衬衫，下身白色裤子；佩戴小巧金色耳钉，双臂交叉于背后。右侧女性留直肩长发，身穿奶油色卫衣，胸前印有“Tun the tables”字样，下方为“New ideas”，搭配白色裤子；佩戴银色小环耳环，双臂交叉于胸前。两人均面带微笑直视镜头。照片，自然光照明，柔和阴影，以藏青、奶油白为主的中性色调，休闲时尚摄影，中等景深，面部和上半身对焦清晰，姿态放松，表情友好，室内环境，地毯地面，纯色背景。"
+negative_prompt = "" # Optional, but would be powerful when you want to remove some unwanted content
+image = pipe(
+    prompt=prompt,
+    negative_prompt=negative_prompt,
+    height=1280,
+    width=720,
+    cfg_normalization=False,
+    num_inference_steps=50,
+    guidance_scale=4,
+    generator=torch.Generator("cuda").manual_seed(42),
+).images[0]
+image.save("example.png")
+```
+## 📜 Citation
+If you find our work useful in your research, please consider citing:
+```bibtex
+@article{team2025zimage,
+  title={Z-Image: An Efficient Image Generation Foundation Model with Single-Stream Diffusion Transformer},
+  author={Z-Image Team},
+  journal={arXiv preprint arXiv:2511.22699},
+  year={2025}
+}
+```

teaser.jpg ADDED Viewed

Git LFS Details

SHA256: 6944f032282144ec4bba1942de2a5df01ae2f6534ad973e61939f835b7ebfdc2
Pointer size: 132 Bytes
Size of remote file: 8.98 MB

z-image-BF16.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:82eebf65df40cb09cf29094e30b3cdf7b628e873b7fe5f32023ddb6f0188490c
+size 12311939136

z-image-F16.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:9e1d3fd6f88e00eda20932b860e2369dbe26b412f196dbf7cf63e4665fe61965
+size 12311939136

z-image-Q2_K.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:9c1857629988bcd6ff3468e5b2fe6c770aa26591f984bd91eecad18f36c77e45
+size 4013115456

z-image-Q3_K_L.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:0912f25ee35775138166e0ddf3f6ec9fca8562c5cb4cc2b346652f238ff1038d
+size 4604183616

z-image-Q3_K_M.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:e0382d4b1affe9e552392aa9c53a20c2d661b4ddd7f8c56f1f34626c2538368c
+size 4559946816

z-image-Q3_K_S.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:04be2a299f4bb62df12fae37a3e93ebbfb0ebc899f493e3959d93b3c2de4591e
+size 4360190016

z-image-Q4_0.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:4c5cfc02e6007ae1f0b0d690f68bda452f51f7b8ab6d9f500f8c5b829fea4377
+size 4585244736

z-image-Q4_1.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:cb8afc552ab71354b5d3cf6592c14bc378857b987df324e6c638bb6ca67d2d99
+size 4850665536

z-image-Q4_K_M.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:a62b929f76553b21f68894e9ed34d24b7fb67fb59b5689fa06981865986cce40
+size 5066995776

z-image-Q4_K_S.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:1a4329507c97af2ac6ea81d69e89dc0a57eeb6117dcddbc6d1b3bd082f0ecde0
+size 4787443776

z-image-Q5_0.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:b1894d2902090e7a9bce21446125a5313ab0955f722a862115fafecc02fbadbf
+size 5263542336

z-image-Q5_1.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:9f8e033c14c54c4a3943c1ac67958aaf56f777fb4c3556ee77d61e172b029e2e
+size 5528963136

z-image-Q5_K_M.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:266a5e7d8e0ab2494cb202d7bd8fcfd79b2161b0678252f3e854e992844811c6
+size 5578099776

z-image-Q5_K_S.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:67b2ebf5e743a4d318644773602ea7cf0ca90bb42b6ff3254c9f13e301bfc8d0
+size 5289562176

z-image-Q6_K.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:cbe468f6095fb3fe8089c015e6c10c6f8416fdd05407865d431fe3493bed75c0
+size 6101921856

z-image-Q8_0.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:a51d4e36f84caa972e0eff65e8bd2961add14374d8a7038a51eefea2fe45f2c9
+size 7224707136