jodie / rd
Ayane22's picture
Create rd
e2d4691
Loading settings from /content/LoRA/config/config_file.toml...
/content/LoRA/config/config_file
prepare tokenizer
Downloading (…)olve/main/vocab.json: 100% 961k/961k [00:00<00:00, 1.13MB/s]
Downloading (…)olve/main/merges.txt: 100% 525k/525k [00:00<00:00, 823kB/s]
Downloading (…)cial_tokens_map.json: 100% 389/389 [00:00<00:00, 257kB/s]
Downloading (…)okenizer_config.json: 100% 905/905 [00:00<00:00, 399kB/s]
update token length: 225
Load dataset config from /content/LoRA/config/dataset_config.toml
prepare images.
found directory /content/LoRA/train_data contains 11 image files
2750 train images with repeating.
0 reg images.
no regularization images / 正則化画像が見つかりませんでした
[Dataset 0]
batch_size: 6
resolution: (512, 512)
enable_bucket: True
min_bucket_reso: 256
max_bucket_reso: 1024
bucket_reso_steps: 64
bucket_no_upscale: False
[Subset 0 of Dataset 0]
image_dir: "/content/LoRA/train_data"
image_count: 11
num_repeats: 250
shuffle_caption: True
keep_tokens: 0
caption_dropout_rate: 0
caption_dropout_every_n_epoches: 0
caption_tag_dropout_rate: 0
color_aug: False
flip_aug: False
face_crop_aug_range: None
random_crop: False
token_warmup_min: 1,
token_warmup_step: 0,
is_reg: False
class_tokens: mksks
caption_extension: .txt
[Dataset 0]
loading image sizes.
100% 11/11 [00:00<00:00, 439.09it/s]
make buckets
number of images (including repeats) / 各bucketの画像枚数(繰り返し回数を含む)
bucket 0: resolution (384, 640), count: 250
bucket 1: resolution (512, 512), count: 1000
bucket 2: resolution (576, 448), count: 500
bucket 3: resolution (640, 384), count: 1000
mean ar error (without repeats): 0.0969292682863697
prepare accelerator
Using accelerator 0.15.0 or above.
loading model for process 0/1
load StableDiffusion checkpoint
loading u-net: <All keys matched successfully>
loading vae: <All keys matched successfully>
Downloading (…)lve/main/config.json: 100% 4.52k/4.52k [00:00<00:00, 3.13MB/s]
Downloading pytorch_model.bin: 100% 1.71G/1.71G [00:23<00:00, 73.3MB/s]
loading text encoder: <All keys matched successfully>
Replace CrossAttention.forward to use xformers
[Dataset 0]
caching latents.
100% 5/5 [00:13<00:00, 2.62s/it]
import network module: lycoris.kohya
Using rank adaptation algo: lora
Use Dropout value: 0.0
Create LyCORIS Module
create LyCORIS for Text Encoder: 72 modules.
Create LyCORIS Module
create LyCORIS for U-Net: 278 modules.
enable LyCORIS for text encoder
enable LyCORIS for U-Net
prepare optimizer, data loader etc.
Deprecated: use prepare_optimizer_params(text_encoder_lr, unet_lr, learning_rate) instead of prepare_optimizer_params(text_encoder_lr, unet_lr)
CUDA SETUP: CUDA runtime path found: /usr/local/cuda-11.8/targets/x86_64-linux/lib/libcudart.so
CUDA SETUP: Highest compute capability among GPUs detected: 7.5
CUDA SETUP: Detected CUDA version 118
CUDA SETUP: Loading binary /usr/local/lib/python3.10/dist-packages/bitsandbytes/libbitsandbytes_cuda118.so...
use 8-bit AdamW optimizer | {}
override steps. steps for 2 epochs is / 指定エポックまでのステップ数: 920
running training / 学習開始
num train images * repeats / 学習画像の数×繰り返し回数: 2750
num reg images / 正則化画像の数: 0
num batches per epoch / 1epochのバッチ数: 460
num epochs / epoch数: 2
batch size per device / バッチサイズ: 6
gradient accumulation steps / 勾配を合計するステップ数 = 1
total optimization steps / 学習ステップ数: 920
steps: 0% 0/920 [00:00<?, ?it/s]epoch 1/2
steps: 50% 460/920 [11:32<11:32, 1.51s/it, loss=0.0465]saving checkpoint: /content/drive/MyDrive/LoRA/output/last-000001.safetensors
generating sample images at step / サンプル画像生成 ステップ: 460
prompt: (masterpiece, best quality, highres, illustration, perfect anatomy), 1girl, solo, 40 years old mature female, sharp mature face, lipstick, red lips, beautiful pupil, sexy blue eyes:1.2, highlights in eyes:1.4, short hair, parted bangs, blonde hair, glasses, long nose, glamorous proportion, business suit
negative_prompt: lowres, bad anatomy, bad hands, text, error, missing fingers, extra digit, fewer digits, cropped, worst quality, low quality, normal quality, jpeg artifacts, signature, watermark, username, blurry
height: 768
width: 512
sample_steps: 28
scale: 7.0
0% 0/28 [00:00<?, ?it/s]
4% 1/28 [00:00<00:15, 1.73it/s]
7% 2/28 [00:00<00:11, 2.35it/s]
11% 3/28 [00:01<00:09, 2.63it/s]
14% 4/28 [00:01<00:08, 2.77it/s]
18% 5/28 [00:01<00:07, 2.88it/s]
21% 6/28 [00:02<00:07, 2.95it/s]
25% 7/28 [00:02<00:07, 2.97it/s]
29% 8/28 [00:02<00:06, 3.00it/s]
32% 9/28 [00:03<00:06, 3.02it/s]
36% 10/28 [00:03<00:05, 3.03it/s]
39% 11/28 [00:03<00:05, 3.04it/s]
43% 12/28 [00:04<00:05, 3.05it/s]
46% 13/28 [00:04<00:04, 3.05it/s]
50% 14/28 [00:04<00:04, 3.05it/s]
54% 15/28 [00:05<00:04, 3.05it/s]
57% 16/28 [00:05<00:03, 3.05it/s]
61% 17/28 [00:05<00:03, 3.05it/s]
64% 18/28 [00:06<00:03, 3.06it/s]
68% 19/28 [00:06<00:02, 3.05it/s]
71% 20/28 [00:06<00:02, 3.05it/s]
75% 21/28 [00:07<00:02, 3.05it/s]
79% 22/28 [00:07<00:01, 3.05it/s]
82% 23/28 [00:07<00:01, 3.05it/s]
86% 24/28 [00:08<00:01, 3.05it/s]
89% 25/28 [00:08<00:00, 3.05it/s]
93% 26/28 [00:08<00:00, 3.06it/s]
96% 27/28 [00:09<00:00, 3.06it/s]
100% 28/28 [00:09<00:00, 2.98it/s]
epoch 2/2
steps: 100% 920/920 [23:20<00:00, 1.52s/it, loss=0.0274]generating sample images at step / サンプル画像生成 ステップ: 920
prompt: (masterpiece, best quality, highres, illustration, perfect anatomy), 1girl, solo, 40 years old mature female, sharp mature face, lipstick, red lips, beautiful pupil, sexy blue eyes:1.2, highlights in eyes:1.4, short hair, parted bangs, blonde hair, glasses, long nose, glamorous proportion, business suit
negative_prompt: lowres, bad anatomy, bad hands, text, error, missing fingers, extra digit, fewer digits, cropped, worst quality, low quality, normal quality, jpeg artifacts, signature, watermark, username, blurry
height: 768
width: 512
sample_steps: 28
scale: 7.0
0% 0/28 [00:00<?, ?it/s]
4% 1/28 [00:00<00:08, 3.11it/s]
7% 2/28 [00:00<00:08, 3.10it/s]
11% 3/28 [00:00<00:08, 3.01it/s]
14% 4/28 [00:01<00:07, 3.05it/s]
18% 5/28 [00:01<00:07, 3.07it/s]
21% 6/28 [00:01<00:07, 3.04it/s]
25% 7/28 [00:02<00:06, 3.04it/s]
29% 8/28 [00:02<00:06, 3.06it/s]
32% 9/28 [00:02<00:06, 3.05it/s]
36% 10/28 [00:03<00:05, 3.05it/s]
39% 11/28 [00:03<00:05, 3.06it/s]
43% 12/28 [00:03<00:05, 3.06it/s]
46% 13/28 [00:04<00:04, 3.06it/s]
50% 14/28 [00:04<00:04, 3.07it/s]
54% 15/28 [00:04<00:04, 3.06it/s]
57% 16/28 [00:05<00:03, 3.06it/s]
61% 17/28 [00:05<00:03, 3.06it/s]
64% 18/28 [00:05<00:03, 3.06it/s]
68% 19/28 [00:06<00:02, 3.05it/s]
71% 20/28 [00:06<00:02, 3.06it/s]
75% 21/28 [00:06<00:02, 3.06it/s]
79% 22/28 [00:07<00:01, 3.05it/s]
82% 23/28 [00:07<00:01, 3.05it/s]
86% 24/28 [00:07<00:01, 3.05it/s]
89% 25/28 [00:08<00:00, 3.05it/s]
93% 26/28 [00:08<00:00, 3.05it/s]
96% 27/28 [00:08<00:00, 3.05it/s]
100% 28/28 [00:09<00:00, 3.05it/s]
save trained model to /content/drive/MyDrive/LoRA/output/last.safetensors
model saved.
steps: 100% 920/920 [23:31<00:00, 1.53s/it, loss=0.0274]