ABDALLALSWAITI commited on
Commit
2562af0
Β·
verified Β·
1 Parent(s): a4c719d

Upload folder using huggingface_hub

Browse files
.gitattributes CHANGED
@@ -33,3 +33,22 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
36
+ 48ecf48e-20a5-4f5f-8532-a9a8b4d838f0.png filter=lfs diff=lfs merge=lfs -text
37
+ swappy-20260101-222728.png filter=lfs diff=lfs merge=lfs -text
38
+ z-image_00164_.png filter=lfs diff=lfs merge=lfs -text
39
+ z-image_00168_.png filter=lfs diff=lfs merge=lfs -text
40
+ z-image_00172_.png filter=lfs diff=lfs merge=lfs -text
41
+ z-image_00176_.png filter=lfs diff=lfs merge=lfs -text
42
+ z-image_00182_.png filter=lfs diff=lfs merge=lfs -text
43
+ z-image_00242_.png filter=lfs diff=lfs merge=lfs -text
44
+ z-image_00252_.png filter=lfs diff=lfs merge=lfs -text
45
+ z-image_00256_.png filter=lfs diff=lfs merge=lfs -text
46
+ z-image_00260_.png filter=lfs diff=lfs merge=lfs -text
47
+ z-image_00268_.png filter=lfs diff=lfs merge=lfs -text
48
+ z-image_00272_.png filter=lfs diff=lfs merge=lfs -text
49
+ z-image_00276_.png filter=lfs diff=lfs merge=lfs -text
50
+ z-image_00286_.png filter=lfs diff=lfs merge=lfs -text
51
+ z-image_00292_.png filter=lfs diff=lfs merge=lfs -text
52
+ z-image_00296_.png filter=lfs diff=lfs merge=lfs -text
53
+ z-image_00301_.png filter=lfs diff=lfs merge=lfs -text
54
+ z-image_00306_.png filter=lfs diff=lfs merge=lfs -text
48ecf48e-20a5-4f5f-8532-a9a8b4d838f0.png ADDED

Git LFS Details

  • SHA256: 8206cccc434fd652c8f3e665963674541c58189231dabe801ef52b50cf6a8cd4
  • Pointer size: 131 Bytes
  • Size of remote file: 124 kB
README.md ADDED
@@ -0,0 +1,169 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: other
3
+ library_name: diffusers
4
+ tags:
5
+ - text-to-image
6
+ - lora
7
+ - diffusers
8
+ - z-image-turbo
9
+ - anime
10
+ base_model: Tongyi-MAI/Z-Image-Turbo
11
+ ---
12
+
13
+ # Z-Image Turbo LoRA Guide (Best Version)
14
+
15
+ ![Cover Image](z-image_00306_.png)
16
+
17
+ **Updated:** Jan 7, 2026
18
+ **Type:** LoRA
19
+ **Base Model:** ZImageTurbo
20
+ **Training:** Steps: 5,000 | Epochs: 42
21
+ **Trigger Word:** `aimaginedworlds`
22
+
23
+ ## The Best Result: V1 Adapter Training
24
+ **Style:** Anime / Illustration
25
+ **Base Model:** Tongyi-MAI/Z-Image-Turbo
26
+
27
+ ## πŸ“– My Story: The Road to the "Perfect" LoRA
28
+ I want to share my experience training this LoRAβ€”not just the final product, but the entire journey, because I believe transparency helps the community learn.
29
+
30
+ ### 🚫 Attempt 1: The 1000-Image Dataset + V2 Adapter
31
+ I started big. I thought more data = better results, so I gathered 1000 images and used the newest adapter:
32
+ * **Adapter:** `ostris/zimage_turbo_training_adapter V2`
33
+ * **Result:** Complete failure. The LoRA didn't capture the anime style at all. The outputs looked generic and lacked any personality from the training data.
34
+
35
+ ### ⚠️ Attempt 2: Curated 100+ Image Dataset + V2 Adapter
36
+ I realized quality beats quantity. I carefully curated a smaller dataset of ~118 high-quality anime images with detailed captions.
37
+ * **Result:** Better, but still not amazing. The V2 adapter seemed to struggle with strong style transfer. The outputs were "okay," but not the striking anime aesthetic I was aiming for.
38
+
39
+ ### πŸ”„ Attempt 3: Trying Z-Image-De-Turbo
40
+ I switched gears entirely. I thought maybe training on the non-turbo base model would give me more control:
41
+ * **Model:** `ostris/Z-Image-De-Turbo`
42
+ * **Result:** Nothing amazing. While technically capable, it didn't produce the vibrant, stylized anime look I wanted. It felt "flat."
43
+
44
+ ### βœ… Attempt 4: The V1 Adapter β€” THE WINNER!
45
+ Out of frustration, I went back to the original V1 adapter. And guess what?
46
+ * **Adapter:** `ostris/zimage_turbo_training_adapter_v1.safetensors`
47
+ * **Dataset:** My curated 118 anime images
48
+ * **Result:** **AMAZING!** This was the breakthrough. The V1 adapter, combined with the right settings, finally captured the anime style beautifully. Fast inference, strong style, and consistent quality.
49
+
50
+ *Sometimes, the "old" version just works better.*
51
+
52
+ ## πŸ’Έ The Real Cost: What This Training Cost Me
53
+ Training LoRAs isn't free. Here's the honest breakdown of what I spent on Modal cloud compute to reach this result:
54
+
55
+ * **GPU Used:** NVIDIA H200
56
+ * **Total Training Runs:** 10+
57
+ * **Total Cost:** ~$60
58
+ * **Time Invested:** Multiple days of experimentation...
59
+
60
+ That's $60 and countless hours of debugging, testing different adapters, adjusting hyperparameters, and waiting for training jobs to complete, all to find the perfect combination.
61
+
62
+ ![Training Cost](48ecf48e-20a5-4f5f-8532-a9a8b4d838f0.png)
63
+
64
+ ## βš™οΈ The Winning Configuration
65
+ Here is the exact configuration that produced the best results. Feel free to use it as a starting point for your own training!
66
+
67
+ ```yaml
68
+ job: "extension"
69
+ config:
70
+ name: "aimaginedworlds_turbo"
71
+ process:
72
+ - type: "diffusion_trainer"
73
+ training_folder: "/root/ai-toolkit/modal_output"
74
+ device: "cuda"
75
+ trigger_word: "aimaginedworlds"
76
+ network:
77
+ type: "lora"
78
+ linear: 32
79
+ linear_alpha: 32
80
+ conv: 16
81
+ conv_alpha: 16
82
+ save:
83
+ dtype: "bf16"
84
+ save_every: 250
85
+ max_step_saves_to_keep: 4
86
+ datasets:
87
+ - folder_path: "/root/ai-toolkit/training_data/aimaginedworlds"
88
+ caption_ext: "txt"
89
+ caption_dropout_rate: 0.05
90
+ resolution:
91
+ - 512
92
+ - 768
93
+ - 1024
94
+ train:
95
+ batch_size: 1
96
+ steps: 5000
97
+ gradient_checkpointing: true
98
+ noise_scheduler: "flowmatch"
99
+ optimizer: "adamw8bit"
100
+ lr: 0.0001
101
+ dtype: "bf16"
102
+ model:
103
+ name_or_path: "Tongyi-MAI/Z-Image-Turbo"
104
+ arch: "zimage:turbo"
105
+ assistant_lora_path: "ostris/zimage_turbo_training_adapter/zimage_turbo_training_adapter_v1.safetensors"
106
+ sample:
107
+ sampler: "flowmatch"
108
+ sample_every: 250
109
+ guidance_scale: 1
110
+ sample_steps: 8
111
+ ```
112
+
113
+ ### Key Settings:
114
+ * **Rank 32/Alpha 32:** The sweet spot for style without overfitting.
115
+ * **V1 Adapter:** The secret sauce!
116
+ * **5000 Steps:** Enough for full convergence.
117
+ * **FlowMatch Scheduler:** Native to Z-Image Turbo.
118
+
119
+ ## πŸš€ How to Use This LoRA
120
+ This LoRA was trained specifically for anime/illustration style. It works best when you keep prompts simple and let the trigger word do the heavy lifting.
121
+
122
+ ## 🎨 Showcase
123
+ Here are some examples of what you can create:
124
+
125
+ | | | |
126
+ |:---:|:---:|:---:|
127
+ | ![Showcase 1](z-image_00268_.png) | ![Showcase 2](z-image_00272_.png) | ![Showcase 3](z-image_00292_.png) |
128
+ | ![Showcase 4](z-image_00301_.png) | ![Showcase 5](z-image_00252_.png) | ![Showcase 6](z-image_00242_.png) |
129
+
130
+ ### ✨ The Trigger Word
131
+ Just add `aimaginedworlds` at the start of your prompt:
132
+
133
+ > `aimaginedworlds, a girl with blue hair sitting in a cafe`
134
+
135
+ That's it! You don't need complex prompting, the style is baked in.
136
+
137
+ ### πŸ”Œ Recommended: Z-Image-Turbo Prompt Template Node
138
+ For optimal results with this LoRA, use my **ComfyUI-OllamaGemini** node with the new Z-Image-Turbo prompt template:
139
+
140
+ πŸ”— **[ComfyUI-OllamaGemini](https://github.com/AbdallahAlswaiti/ComfyUI-OllamaGemini)**
141
+
142
+ It does magic prompting using Flux, Veo3.1, Qwen, Gemini, Banana Pro, Imagen4, and more!
143
+
144
+ ![Magic Prompting](swappy-20260101-222728.png)
145
+
146
+ ## ❀️ Support My Work
147
+ Creating high-quality LoRAs takes real time, effort, and money. As you saw above, this project alone cost me ~$60 in cloud compute and days of experimentation.
148
+
149
+ If this LoRA helps you create beautiful images, please consider supporting my work. Even a small contribution helps me:
150
+ * πŸ–₯️ Cover cloud compute costs for future models
151
+ * 🎨 Train more high-quality anime LoRAs
152
+ * πŸ“š Share my findings with the community
153
+
154
+ Every bit of support means the world to me and keeps me going!
155
+
156
+ ## πŸ› οΈ Tools & Credits
157
+ This LoRA was trained using the amazing **AI-Toolkit** by Ostris:
158
+ πŸ”— [https://github.com/ostris/ai-toolkit](https://github.com/ostris/ai-toolkit)
159
+
160
+ If you're interested in training your own LoRAs, I highly recommend checking it out. It's powerful, well-documented, and actively maintained!
161
+
162
+ ## πŸ™ How You Can Help
163
+ If you found this useful, here are some ways to support:
164
+ * πŸ’Έ **Support via PayPal** β€” Help cover those GPU costs!
165
+ * πŸ“’ **Share your creations** β€” Tag me so I can see what you make!
166
+
167
+ [**PayPal**](https://paypal.me/AbdallahAlswaiti)
168
+
169
+ *Made with ❀️, frustration, and a lot of GPU hours.*
aimaginedworlds_turbo.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:34d27f3c3ef36791284726d684877f445453b57089bb05d47c4b6aaf4a40b7ae
3
+ size 170128312
swappy-20260101-222728.png ADDED

Git LFS Details

  • SHA256: 2795103ffc4e883cb12989bc40a6cfb572a83953902f7049f546a2e935c6ae90
  • Pointer size: 131 Bytes
  • Size of remote file: 240 kB
z-image_00164_.png ADDED

Git LFS Details

  • SHA256: 41c13b35ed1d895237b64e3083ded1d59f60c82b891a937412e1abf649c099d8
  • Pointer size: 132 Bytes
  • Size of remote file: 2.32 MB
z-image_00168_.png ADDED

Git LFS Details

  • SHA256: af2a9c87ea4e3b05ba76ae2b7243d13c1edc30134b37f3831a2eb46ce8e74939
  • Pointer size: 132 Bytes
  • Size of remote file: 2.46 MB
z-image_00172_.png ADDED

Git LFS Details

  • SHA256: 4eb8e59162efc90a8434742869972ed4556b35c09ef7b016c895963bf456c5d8
  • Pointer size: 132 Bytes
  • Size of remote file: 1.91 MB
z-image_00176_.png ADDED

Git LFS Details

  • SHA256: 83fed4f6f14603fddf0b1244ba73ce29283838b320dfc2ef7814b1bc38ad3ae1
  • Pointer size: 132 Bytes
  • Size of remote file: 1.99 MB
z-image_00182_.png ADDED

Git LFS Details

  • SHA256: e04361e08d731001ea5502002548513aab3567b6f334026d92612c622497be63
  • Pointer size: 132 Bytes
  • Size of remote file: 2.24 MB
z-image_00242_.png ADDED

Git LFS Details

  • SHA256: 72c3bd36c7dbc534eeb7c78678aa496fdbd09abbdbe3e555065f69c72f11edd7
  • Pointer size: 132 Bytes
  • Size of remote file: 2.37 MB
z-image_00252_.png ADDED

Git LFS Details

  • SHA256: a856a0034ecc247421a63c1d314f798120238f603f36829eb773f38389eca2af
  • Pointer size: 132 Bytes
  • Size of remote file: 2.21 MB
z-image_00256_.png ADDED

Git LFS Details

  • SHA256: bcdbc24ab87a8934ffad5f8af454012beefff1046a14597dd41bbf87ceecb2ff
  • Pointer size: 132 Bytes
  • Size of remote file: 2.67 MB
z-image_00260_.png ADDED

Git LFS Details

  • SHA256: 8ec7bb2655cc4e0b925a89e3e1dfe534be2aa77a33ae290a8d07f30e3a9b8fec
  • Pointer size: 132 Bytes
  • Size of remote file: 2.44 MB
z-image_00268_.png ADDED

Git LFS Details

  • SHA256: af5eb66201e6283dbe8e23c02fb4a9bd699184e4142e7d4996bf6196a0bc5337
  • Pointer size: 132 Bytes
  • Size of remote file: 2.6 MB
z-image_00272_.png ADDED

Git LFS Details

  • SHA256: 0ea8035b08261b2c51a80027915bfb1a6ea1747ecf6239392664afcbbae363b8
  • Pointer size: 132 Bytes
  • Size of remote file: 2.27 MB
z-image_00276_.png ADDED

Git LFS Details

  • SHA256: 56704789909ba815c989cb82036d80fb7f4d4e8e7edbad791551c0193eb310f1
  • Pointer size: 132 Bytes
  • Size of remote file: 2.39 MB
z-image_00286_.png ADDED

Git LFS Details

  • SHA256: a19de5f450f58f11b002b9a118bfd051eb50d2537f38dbb374cddce921244656
  • Pointer size: 132 Bytes
  • Size of remote file: 2.48 MB
z-image_00292_.png ADDED

Git LFS Details

  • SHA256: 888e84c9ddcc7a769350d20447fe8246ae8208462492ed6b6404ee3ed64d93f3
  • Pointer size: 132 Bytes
  • Size of remote file: 2.48 MB
z-image_00296_.png ADDED

Git LFS Details

  • SHA256: ccb58073eb61edf89cb0a8dbcce3a0204bb0ca71601a816d037c3aa1df31ee48
  • Pointer size: 132 Bytes
  • Size of remote file: 2.44 MB
z-image_00301_.png ADDED

Git LFS Details

  • SHA256: a962ef85b347c2bee2649cb7c453c25689405bf8f08a26eed2061d6624b6e2ac
  • Pointer size: 132 Bytes
  • Size of remote file: 2.04 MB
z-image_00306_.png ADDED

Git LFS Details

  • SHA256: c257545af8c6b8ea6cd51e2aeb90dc8e36a9e140cf4e07f2074b189adf8464df
  • Pointer size: 131 Bytes
  • Size of remote file: 827 kB