bubbliiiing commited on
Commit Β·
d47c43f
1
Parent(s): f92b3da
Update Weights
Browse files- .gitattributes +2 -0
- README.md +147 -0
- Z-Image-Fun-Controlnet-Union-2.1-lite.safetensors +3 -0
- Z-Image-Fun-Controlnet-Union-2.1.safetensors +3 -0
- asset/canny.jpg +3 -0
- asset/depth.png +3 -0
- asset/gray.jpg +3 -0
- asset/hed.jpg +3 -0
- asset/inpaint.jpg +3 -0
- asset/low_res.jpg +3 -0
- asset/mask.jpg +3 -0
- asset/pose.jpg +3 -0
- asset/pose2.jpg +3 -0
- asset/pose3.jpg +3 -0
- results/canny.png +3 -0
- results/depth.png +3 -0
- results/gray.png +3 -0
- results/hed.png +3 -0
- results/inpaint.png +3 -0
- results/pose.png +3 -0
- results/pose2.png +3 -0
- results/pose3.png +3 -0
- results/pose_inpaint.png +3 -0
.gitattributes
CHANGED
|
@@ -1,3 +1,5 @@
|
|
|
|
|
|
|
|
| 1 |
*.7z filter=lfs diff=lfs merge=lfs -text
|
| 2 |
*.arrow filter=lfs diff=lfs merge=lfs -text
|
| 3 |
*.bin filter=lfs diff=lfs merge=lfs -text
|
|
|
|
| 1 |
+
*.png filter=lfs diff=lfs merge=lfs -text
|
| 2 |
+
*.jpg filter=lfs diff=lfs merge=lfs -text
|
| 3 |
*.7z filter=lfs diff=lfs merge=lfs -text
|
| 4 |
*.arrow filter=lfs diff=lfs merge=lfs -text
|
| 5 |
*.bin filter=lfs diff=lfs merge=lfs -text
|
README.md
CHANGED
|
@@ -1,3 +1,150 @@
|
|
| 1 |
---
|
| 2 |
license: apache-2.0
|
| 3 |
---
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
---
|
| 2 |
license: apache-2.0
|
| 3 |
---
|
| 4 |
+
|
| 5 |
+
# Z-Image-Fun-Controlnet-Union-2.1
|
| 6 |
+
|
| 7 |
+
[](https://github.com/aigc-apps/VideoX-Fun)
|
| 8 |
+
|
| 9 |
+
## Model Card
|
| 10 |
+
|
| 11 |
+
| Name | Description |
|
| 12 |
+
|--|--|
|
| 13 |
+
| Z-Image-Fun-Controlnet-Union-2.1.safetensors | ControlNet weights for Z-Image. The model supports multiple control conditions such as Canny, Depth, Pose, MLSD, Scribble, Hed and Gray. This ControlNet is added on 15 layer blocks and 2 refiner layer blocks. |
|
| 14 |
+
| Z-Image-Fun-Controlnet-Union-2.1-lite.safetensors | Uses the same training scheme as the 2601 version, but compared to the large version of the model, fewer layers have control added, resulting in weaker control conditions. This makes it suitable for larger control_context_scale values, and the generation results appear more natural. It is also suitable for lower-spec machines. |
|
| 15 |
+
|
| 16 |
+
## Model Features
|
| 17 |
+
- This ControlNet is added on 15 layer blocks and 2 refiner layer blocks (Lite models are added on 3 layer blocks and 2 refiner blocks). It supports multiple control conditionsβincluding Canny, Depth, Pose, MLSD, Scribble, Hed and Gray can be used like a standard ControlNet.
|
| 18 |
+
- Inpainting mode is also supported.
|
| 19 |
+
- You can adjust control_context_scale for stronger control and better detail preservation. For better stability, we highly recommend using a detailed prompt. The optimal range for control_context_scale is from 0.65 to 0.90.
|
| 20 |
+
|
| 21 |
+
## Results
|
| 22 |
+
|
| 23 |
+
<table border="0" style="width: 100%; text-align: left; margin-top: 20px;">
|
| 24 |
+
<tr>
|
| 25 |
+
<td>Pose + Inpaint</td>
|
| 26 |
+
<td>Output</td>
|
| 27 |
+
</tr>
|
| 28 |
+
<tr>
|
| 29 |
+
<td><img src="asset/inpaint.jpg" width="100%" /><img src="asset/mask.jpg" width="100%" /></td>
|
| 30 |
+
<td><img src="results/inpaint.png" width="100%" /></td>
|
| 31 |
+
</tr>
|
| 32 |
+
</table>
|
| 33 |
+
|
| 34 |
+
<table border="0" style="width: 100%; text-align: left; margin-top: 20px;">
|
| 35 |
+
<tr>
|
| 36 |
+
<td>Pose + Inpaint</td>
|
| 37 |
+
<td>Output</td>
|
| 38 |
+
</tr>
|
| 39 |
+
<tr>
|
| 40 |
+
<td><img src="asset/inpaint.jpg" width="100%" /><img src="asset/mask.jpg" width="100%" /><img src="asset/pose.jpg" width="100%" /></td>
|
| 41 |
+
<td><img src="results/pose_inpaint.png" width="100%" /></td>
|
| 42 |
+
</tr>
|
| 43 |
+
</table>
|
| 44 |
+
|
| 45 |
+
<table border="0" style="width: 100%; text-align: left; margin-top: 20px;">
|
| 46 |
+
<tr>
|
| 47 |
+
<td>Pose</td>
|
| 48 |
+
<td>Output</td>
|
| 49 |
+
</tr>
|
| 50 |
+
<tr>
|
| 51 |
+
<td><img src="asset/pose2.jpg" width="100%" /></td>
|
| 52 |
+
<td><img src="results/pose2.png" width="100%" /></td>
|
| 53 |
+
</tr>
|
| 54 |
+
</table>
|
| 55 |
+
|
| 56 |
+
<table border="0" style="width: 100%; text-align: left; margin-top: 20px;">
|
| 57 |
+
<tr>
|
| 58 |
+
<td>Pose</td>
|
| 59 |
+
<td>Output</td>
|
| 60 |
+
</tr>
|
| 61 |
+
<tr>
|
| 62 |
+
<td><img src="asset/pose.jpg" width="100%" /></td>
|
| 63 |
+
<td><img src="results/pose.png" width="100%" /></td>
|
| 64 |
+
</tr>
|
| 65 |
+
</table>
|
| 66 |
+
|
| 67 |
+
<table border="0" style="width: 100%; text-align: left; margin-top: 20px;">
|
| 68 |
+
<tr>
|
| 69 |
+
<td>Pose</td>
|
| 70 |
+
<td>Output</td>
|
| 71 |
+
</tr>
|
| 72 |
+
<tr>
|
| 73 |
+
<td><img src="asset/pose3.jpg" width="100%" /></td>
|
| 74 |
+
<td><img src="results/pose3.png" width="100%" /></td>
|
| 75 |
+
</tr>
|
| 76 |
+
</table>
|
| 77 |
+
|
| 78 |
+
<table border="0" style="width: 100%; text-align: left; margin-top: 20px;">
|
| 79 |
+
<tr>
|
| 80 |
+
<td>Canny</td>
|
| 81 |
+
<td>Output</td>
|
| 82 |
+
</tr>
|
| 83 |
+
<tr>
|
| 84 |
+
<td><img src="asset/canny.jpg" width="100%" /></td>
|
| 85 |
+
<td><img src="results/canny.png" width="100%" /></td>
|
| 86 |
+
</tr>
|
| 87 |
+
</table>
|
| 88 |
+
|
| 89 |
+
<table border="0" style="width: 100%; text-align: left; margin-top: 20px;">
|
| 90 |
+
<tr>
|
| 91 |
+
<td>HED</td>
|
| 92 |
+
<td>Output</td>
|
| 93 |
+
</tr>
|
| 94 |
+
<tr>
|
| 95 |
+
<td><img src="asset/hed.jpg" width="100%" /></td>
|
| 96 |
+
<td><img src="results/hed.png" width="100%" /></td>
|
| 97 |
+
</tr>
|
| 98 |
+
</table>
|
| 99 |
+
|
| 100 |
+
<table border="0" style="width: 100%; text-align: left; margin-top: 20px;">
|
| 101 |
+
<tr>
|
| 102 |
+
<td>Depth</td>
|
| 103 |
+
<td>Output</td>
|
| 104 |
+
</tr>
|
| 105 |
+
<tr>
|
| 106 |
+
<td><img src="asset/depth.jpg" width="100%" /></td>
|
| 107 |
+
<td><img src="results/depth.png" width="100%" /></td>
|
| 108 |
+
</tr>
|
| 109 |
+
</table>
|
| 110 |
+
|
| 111 |
+
<table border="0" style="width: 100%; text-align: left; margin-top: 20px;">
|
| 112 |
+
<tr>
|
| 113 |
+
<td>Depth</td>
|
| 114 |
+
<td>Output</td>
|
| 115 |
+
</tr>
|
| 116 |
+
<tr>
|
| 117 |
+
<td><img src="asset/gray.jpg" width="100%" /></td>
|
| 118 |
+
<td><img src="results/gray.png" width="100%" /></td>
|
| 119 |
+
</tr>
|
| 120 |
+
</table>
|
| 121 |
+
|
| 122 |
+
## Inference
|
| 123 |
+
Go to the VideoX-Fun repository for more details.
|
| 124 |
+
|
| 125 |
+
Please clone the VideoX-Fun repository and create the required directories:
|
| 126 |
+
|
| 127 |
+
```sh
|
| 128 |
+
# Clone the code
|
| 129 |
+
git clone https://github.com/aigc-apps/VideoX-Fun.git
|
| 130 |
+
|
| 131 |
+
# Enter VideoX-Fun's directory
|
| 132 |
+
cd VideoX-Fun
|
| 133 |
+
|
| 134 |
+
# Create model directories
|
| 135 |
+
mkdir -p models/Diffusion_Transformer
|
| 136 |
+
mkdir -p models/Personalized_Model
|
| 137 |
+
```
|
| 138 |
+
|
| 139 |
+
Then download the weights into models/Diffusion_Transformer and models/Personalized_Model.
|
| 140 |
+
|
| 141 |
+
```
|
| 142 |
+
π¦ models/
|
| 143 |
+
βββ π Diffusion_Transformer/
|
| 144 |
+
β βββ π Z-Image/
|
| 145 |
+
βββ π Personalized_Model/
|
| 146 |
+
β βββ π¦ Z-Image-Fun-Controlnet-Union-2.1.safetensors
|
| 147 |
+
β βββ π¦ Z-Image-Fun-Controlnet-Union-2.1-lite.safetensors
|
| 148 |
+
```
|
| 149 |
+
|
| 150 |
+
Then run the file `examples/z_image_fun/predict_t2i_control_2.1.py` and `examples/z_image_fun/predict_i2i_inpaint_2.1.py`.
|
Z-Image-Fun-Controlnet-Union-2.1-lite.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:3c3e2265d1c301b444e83d668a2d12416b0b6d91cd2d18bc11eec12a7fc335cb
|
| 3 |
+
size 2016627488
|
Z-Image-Fun-Controlnet-Union-2.1.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:2393b0c58c52a12134f6ffd96ff9b6ea3c80bb233665fb2c3b9aebcee71ae3e4
|
| 3 |
+
size 6712485600
|
asset/canny.jpg
ADDED
|
Git LFS Details
|
asset/depth.png
ADDED
|
Git LFS Details
|
asset/gray.jpg
ADDED
|
Git LFS Details
|
asset/hed.jpg
ADDED
|
Git LFS Details
|
asset/inpaint.jpg
ADDED
|
Git LFS Details
|
asset/low_res.jpg
ADDED
|
Git LFS Details
|
asset/mask.jpg
ADDED
|
Git LFS Details
|
asset/pose.jpg
ADDED
|
Git LFS Details
|
asset/pose2.jpg
ADDED
|
Git LFS Details
|
asset/pose3.jpg
ADDED
|
Git LFS Details
|
results/canny.png
ADDED
|
Git LFS Details
|
results/depth.png
ADDED
|
Git LFS Details
|
results/gray.png
ADDED
|
Git LFS Details
|
results/hed.png
ADDED
|
Git LFS Details
|
results/inpaint.png
ADDED
|
Git LFS Details
|
results/pose.png
ADDED
|
Git LFS Details
|
results/pose2.png
ADDED
|
Git LFS Details
|
results/pose3.png
ADDED
|
Git LFS Details
|
results/pose_inpaint.png
ADDED
|
Git LFS Details
|