Kokoboy commited on
Commit
c7729df
·
1 Parent(s): 00835cb

Initial release: V1, V2, and V3 models

Browse files
README.md CHANGED
@@ -1,3 +1,126 @@
1
  ---
2
  license: apache-2.0
 
 
 
 
 
 
 
 
 
 
 
3
  ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
  license: apache-2.0
3
+ license_name: apache-2.0
4
+ tags:
5
+ - lora
6
+ - manga
7
+ - coloring
8
+ - anime
9
+ - qwen
10
+ - dataset
11
+ - diffusers
12
+ - image-to-image
13
+ viewer: false
14
  ---
15
+
16
+ # PanelPainter-Project
17
+
18
+ **PanelPainter-Project** is the central repository for the PanelPainter manga coloring LoRAs.
19
+
20
+ This project is dedicated to training LoRAs to automate the coloring of black-and-white manga panels. I am releasing all the files here, including datasets, logs, and experimental versions, so others can see exactly how it was trained.
21
+
22
+ ## Project Structure
23
+
24
+ This repository contains everything used to create the models:
25
+
26
+ ### 1. LoRA Models (`/loras`)
27
+ This directory contains the model weights for all iterations of the project:
28
+
29
+ * **V3 (Latest Release):** `PanelPainter_v3_Qwen2511.safetensors`
30
+ * **Base:** Qwen Image Edit 2511
31
+ * **Note:** The latest model trained on the expanded 903-image dataset.
32
+ * **V2 (Stable):** `PanelPainter_v2_Qwen2509.safetensors`
33
+ * **Base:** Qwen Image Edit 2509 (Compatible with 2511).
34
+ * **Note:** Standard release (High quality, low variety).
35
+ * **V1 (Legacy):** `PanelPainter_v1_Legacy.safetensors`
36
+ * **Base:** Qwen Image Edit 2509
37
+ * **Note:** Archived experimental version (synthetic data).
38
+
39
+ ### 2. Training Logs (`/logs`)
40
+ **Content:** Tensorboard logs and charts from my training runs. You can check these to see how the loss converged and how the model learned over time for each version.
41
+
42
+ ### 3. Training Dataset
43
+ The datasets used for this project are hosted separately:
44
+
45
+ * **PanelPainter-Dataset**
46
+ * Contains the curated image pairs used for training the active versions.
47
+
48
+ ---
49
+
50
+ ## Version History & Development Log
51
+
52
+ ### Version 3.0 (Current Release)
53
+ * **Status:** Released.
54
+ * **Base Architecture:** Qwen 2511.
55
+ * **Strategy:** Scaling Up High-Quality Data.
56
+ * **Dataset:** Expanded to 903 images.
57
+ * **Summary:** This version combines the correct "real line art" training method discovered in V2 with a significantly larger dataset. This improves the model's ability to generalize across different manga styles while maintaining the color quality of V2.
58
+
59
+ ### Version 2.0
60
+ * **Status:** Released / Stable.
61
+ * **Base Model:** Trained on Qwen Image Edit 2509, also it works on Qwen 2511 as well.
62
+ * **The Breakthrough:** After V1 failed, this version switched to training on real line art instead of synthetic grayscale.
63
+ * **Dataset:** A tiny, hyper-curated set of 150 images (70% Doujin / 30% SFW).
64
+ * **Outcome:** Despite the small size, it proved that high-quality real line art outperforms massive synthetic datasets. It produces good colors but lacks variety due to the small sample size.
65
+
66
+ ### Version 1.0
67
+ * **Status:** Archived / Deprecated.
68
+ * **Base Model:** Qwen Image Edit 2509.
69
+ * **The Mistake:** Trained on 7,000 images generated by simply desaturating colored pages (synthetic grayscale).
70
+ * **Outcome:** The model learned to color "perfect gray" inputs but failed on real, imperfect ink lines.
71
+ * **Lesson:** Quantity does not matter if the data distribution doesn't match real usage.
72
+
73
+ ---
74
+
75
+ ## Training Configuration (V3)
76
+
77
+ **Hardware:** Trained on an A40 GPU on Runpod for approximately two days.
78
+
79
+ Below is the exact accelerate command used to train the V3 model on Musubi Tuner:
80
+
81
+ ```bash
82
+ accelerate launch --num_cpu_threads_per_process 1 --mixed_precision bf16 \
83
+ /workspace/musubi-tuner/src/musubi_tuner/qwen_image_train_network.py \
84
+ --dataset_config dataset_edit.toml \
85
+ --dit /workspace/Training_Models_Qwen/Qwen_Image_Edit_2511_BF16.safetensors \
86
+ --vae /workspace/Training_Models_Qwen/qwen_train_vae.safetensors \
87
+ --text_encoder /workspace/Training_Models_Qwen/qwen_2.5_vl_7b_bf16.safetensors \
88
+ --model_version edit-2511 \
89
+ --network_module networks.lora_qwen_image \
90
+ --output_dir /workspace/output_panelpainter \
91
+ --output_name panelpainter_v3_part1 \
92
+ --mixed_precision bf16 \
93
+ --max_data_loader_n_workers 0 \
94
+ --learning_rate 3e-4 \
95
+ --network_dim 128 \
96
+ --network_alpha 128 \
97
+ --optimizer_type adafactor \
98
+ --optimizer_args "scale_parameter=False" "relative_step=False" "warmup_init=False" "weight_decay=0.01" \
99
+ --lr_scheduler cosine \
100
+ --lr_warmup_steps 150 \
101
+ --timestep_sampling qinglong_qwen \
102
+ --discrete_flow_shift 2.2 \
103
+ --max_train_epochs 8 \
104
+ --save_every_n_epochs 1 \
105
+ --save_state \
106
+ --gradient_checkpointing \
107
+ --gradient_checkpointing_cpu_offload \
108
+ --gradient_accumulation_steps 4 \
109
+ --blocks_to_swap 20 \
110
+ --sdpa
111
+ ```
112
+
113
+ ## Data Privacy & License
114
+
115
+ * **Project License:** Apache 2.0
116
+ * **Dataset Disclaimer:** The /dataset folder contains copyrighted manga panels.
117
+ * **For Learning Only:** I am sharing this strictly to show how the model was trained.
118
+ * **Copyright:** The original art belongs to the creators/publishers.
119
+ * **No Selling:** Please do not sell or repackage these images.
120
+
121
+ ## Acknowledgements
122
+
123
+ Trained on Musubi Tuner. Thanks to kohya-ss.
124
+
125
+ ## External Links
126
+ * **Public Model Page:** [Civitai: PanelPainter](https://civitai.com/models/2103847/panelpainter-manga-coloring)
loras/v1/PanelPainter_v1_Legacy.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:42974fc604542faf47cf55aea4da2ae97c8d51a2aec7068553df362486ed4123
3
+ size 295241616
loras/v2/PanelPainter_v2_Qwen2509.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:139c996de30e0a617573b5cc25a0d5a0c974ad7258635b3ae9b59593b9303e66
3
+ size 2359632080