Ayane22's picture
Create rw
2b2656b
Loading settings from /content/LoRA/config/config_file.toml...
/content/LoRA/config/config_file
prepare tokenizer
update token length: 225
Load dataset config from /content/LoRA/config/dataset_config.toml
prepare images.
found directory /content/LoRA/train_data contains 14 image files
2800 train images with repeating.
0 reg images.
no regularization images / 正則化画像が見つかりませんでした
[Dataset 0]
batch_size: 6
resolution: (512, 512)
enable_bucket: True
min_bucket_reso: 256
max_bucket_reso: 1024
bucket_reso_steps: 64
bucket_no_upscale: False
[Subset 0 of Dataset 0]
image_dir: "/content/LoRA/train_data"
image_count: 14
num_repeats: 200
shuffle_caption: True
keep_tokens: 0
caption_dropout_rate: 0
caption_dropout_every_n_epoches: 0
caption_tag_dropout_rate: 0
color_aug: False
flip_aug: False
face_crop_aug_range: None
random_crop: False
token_warmup_min: 1,
token_warmup_step: 0,
is_reg: False
class_tokens: mksks
caption_extension: .txt
[Dataset 0]
loading image sizes.
100% 14/14 [00:00<00:00, 1641.01it/s]
make buckets
number of images (including repeats) / 各bucketの画像枚数(繰り返し回数を含む)
bucket 0: resolution (256, 832), count: 400
bucket 1: resolution (448, 576), count: 800
bucket 2: resolution (512, 512), count: 800
bucket 3: resolution (704, 320), count: 800
mean ar error (without repeats): 0.0496656946415151
prepare accelerator
Using accelerator 0.15.0 or above.
loading model for process 0/1
load StableDiffusion checkpoint
loading u-net: <All keys matched successfully>
loading vae: <All keys matched successfully>
Downloading (…)lve/main/config.json: 100% 4.52k/4.52k [00:00<00:00, 3.94MB/s]
Downloading pytorch_model.bin: 100% 1.71G/1.71G [00:23<00:00, 71.5MB/s]
loading text encoder: <All keys matched successfully>
Replace CrossAttention.forward to use xformers
[Dataset 0]
caching latents.
100% 4/4 [00:08<00:00, 2.15s/it]
import network module: lycoris.kohya
Using rank adaptation algo: lora
Use Dropout value: 0.0
Create LyCORIS Module
create LyCORIS for Text Encoder: 72 modules.
Create LyCORIS Module
create LyCORIS for U-Net: 278 modules.
enable LyCORIS for text encoder
enable LyCORIS for U-Net
prepare optimizer, data loader etc.
Deprecated: use prepare_optimizer_params(text_encoder_lr, unet_lr, learning_rate) instead of prepare_optimizer_params(text_encoder_lr, unet_lr)
CUDA SETUP: CUDA runtime path found: /usr/local/cuda-11.8/targets/x86_64-linux/lib/libcudart.so
CUDA SETUP: Highest compute capability among GPUs detected: 7.5
CUDA SETUP: Detected CUDA version 118
CUDA SETUP: Loading binary /usr/local/lib/python3.10/dist-packages/bitsandbytes/libbitsandbytes_cuda118.so...
use 8-bit AdamW optimizer | {}
override steps. steps for 2 epochs is / 指定エポックまでのステップ数: 938
running training / 学習開始
num train images * repeats / 学習画像の数×繰り返し回数: 2800
num reg images / 正則化画像の数: 0
num batches per epoch / 1epochのバッチ数: 469
num epochs / epoch数: 2
batch size per device / バッチサイズ: 6
gradient accumulation steps / 勾配を合計するステップ数 = 1
total optimization steps / 学習ステップ数: 938
steps: 0% 0/938 [00:00<?, ?it/s]epoch 1/2
steps: 50% 469/938 [11:36<11:36, 1.48s/it, loss=0.0825]saving checkpoint: /content/drive/MyDrive/LoRA/output/megum_AnyLoRA_LocLys-000001.safetensors
generating sample images at step / サンプル画像生成 ステップ: 469
prompt: (masterpiece, best quality, highres), 1 girl, solo, 20 years drunk japanese girl, big cute green eyes:1.3, cute sharp face, two side up:1.4, long pink hair, hair ribon, bangs, white shirt, brown plaid skirt, blue jacket around waist
negative_prompt: lowres, bad anatomy, bad hands, text, error, missing fingers, extra digit, fewer digits, cropped, worst quality, low quality, normal quality, jpeg artifacts, signature, watermark, username, blurry
height: 768
width: 512
sample_steps: 28
scale: 7.0
0% 0/28 [00:00<?, ?it/s]
4% 1/28 [00:00<00:13, 1.95it/s]
7% 2/28 [00:00<00:10, 2.49it/s]
11% 3/28 [00:01<00:09, 2.70it/s]
14% 4/28 [00:01<00:08, 2.79it/s]
18% 5/28 [00:01<00:08, 2.86it/s]
21% 6/28 [00:02<00:07, 2.91it/s]
25% 7/28 [00:02<00:07, 2.94it/s]
29% 8/28 [00:02<00:06, 2.96it/s]
32% 9/28 [00:03<00:06, 2.97it/s]
36% 10/28 [00:03<00:06, 2.97it/s]
39% 11/28 [00:03<00:05, 2.98it/s]
43% 12/28 [00:04<00:05, 2.99it/s]
46% 13/28 [00:04<00:05, 2.99it/s]
50% 14/28 [00:04<00:04, 2.99it/s]
54% 15/28 [00:05<00:04, 2.99it/s]
57% 16/28 [00:05<00:04, 2.99it/s]
61% 17/28 [00:05<00:03, 2.99it/s]
64% 18/28 [00:06<00:03, 2.99it/s]
68% 19/28 [00:06<00:03, 2.99it/s]
71% 20/28 [00:06<00:02, 2.99it/s]
75% 21/28 [00:07<00:02, 2.99it/s]
79% 22/28 [00:07<00:02, 2.99it/s]
82% 23/28 [00:07<00:01, 2.99it/s]
86% 24/28 [00:08<00:01, 2.99it/s]
89% 25/28 [00:08<00:01, 2.99it/s]
93% 26/28 [00:08<00:00, 2.99it/s]
96% 27/28 [00:09<00:00, 3.00it/s]
100% 28/28 [00:09<00:00, 2.94it/s]
epoch 2/2
steps: 100% 938/938 [23:22<00:00, 1.50s/it, loss=0.0495]generating sample images at step / サンプル画像生成 ステップ: 938
prompt: (masterpiece, best quality, highres), 1 girl, solo, 20 years drunk japanese girl, big cute green eyes:1.3, cute sharp face, two side up:1.4, long pink hair, hair ribon, bangs, white shirt, brown plaid skirt, blue jacket around waist
negative_prompt: lowres, bad anatomy, bad hands, text, error, missing fingers, extra digit, fewer digits, cropped, worst quality, low quality, normal quality, jpeg artifacts, signature, watermark, username, blurry
height: 768
width: 512
sample_steps: 28
scale: 7.0
0% 0/28 [00:00<?, ?it/s]
4% 1/28 [00:00<00:08, 3.05it/s]
7% 2/28 [00:00<00:08, 3.01it/s]
11% 3/28 [00:01<00:08, 2.96it/s]
14% 4/28 [00:01<00:08, 2.98it/s]
18% 5/28 [00:01<00:07, 2.99it/s]
21% 6/28 [00:02<00:07, 2.98it/s]
25% 7/28 [00:02<00:07, 2.98it/s]
29% 8/28 [00:02<00:06, 2.99it/s]
32% 9/28 [00:03<00:06, 2.99it/s]
36% 10/28 [00:03<00:06, 2.99it/s]
39% 11/28 [00:03<00:05, 2.99it/s]
43% 12/28 [00:04<00:05, 2.99it/s]
46% 13/28 [00:04<00:05, 2.99it/s]
50% 14/28 [00:04<00:04, 2.99it/s]
54% 15/28 [00:05<00:04, 2.99it/s]
57% 16/28 [00:05<00:04, 2.99it/s]
61% 17/28 [00:05<00:03, 2.99it/s]
64% 18/28 [00:06<00:03, 3.00it/s]
68% 19/28 [00:06<00:03, 2.99it/s]
71% 20/28 [00:06<00:02, 2.99it/s]
75% 21/28 [00:07<00:02, 3.00it/s]
79% 22/28 [00:07<00:02, 3.00it/s]
82% 23/28 [00:07<00:01, 2.99it/s]
86% 24/28 [00:08<00:01, 3.00it/s]
89% 25/28 [00:08<00:01, 2.99it/s]
93% 26/28 [00:08<00:00, 2.99it/s]
96% 27/28 [00:09<00:00, 2.99it/s]
100% 28/28 [00:09<00:00, 2.99it/s]
save trained model to /content/drive/MyDrive/LoRA/output/megum_AnyLoRA_LocLys.safetensors
model saved.
steps: 100% 938/938 [23:34<00:00, 1.51s/it, loss=0.0495]