Create rw

2b2656b almost 3 years ago

6.93 kB

	Loading settings from /content/LoRA/config/config_file.toml...
	/content/LoRA/config/config_file
	prepare tokenizer
	update token length: 225
	Load dataset config from /content/LoRA/config/dataset_config.toml
	prepare images.
	found directory /content/LoRA/train_data contains 14 image files
	2800 train images with repeating.
	0 reg images.
	no regularization images / 正則化画像が見つかりませんでした
	[Dataset 0]
	batch_size: 6
	resolution: (512, 512)
	enable_bucket: True
	min_bucket_reso: 256
	max_bucket_reso: 1024
	bucket_reso_steps: 64
	bucket_no_upscale: False

	[Subset 0 of Dataset 0]
	image_dir: "/content/LoRA/train_data"
	image_count: 14
	num_repeats: 200
	shuffle_caption: True
	keep_tokens: 0
	caption_dropout_rate: 0
	caption_dropout_every_n_epoches: 0
	caption_tag_dropout_rate: 0
	color_aug: False
	flip_aug: False
	face_crop_aug_range: None
	random_crop: False
	token_warmup_min: 1,
	token_warmup_step: 0,
	is_reg: False
	class_tokens: mksks
	caption_extension: .txt


	[Dataset 0]
	loading image sizes.
	100% 14/14 [00:00<00:00, 1641.01it/s]
	make buckets
	number of images (including repeats) / 各bucketの画像枚数（繰り返し回数を含む）
	bucket 0: resolution (256, 832), count: 400
	bucket 1: resolution (448, 576), count: 800
	bucket 2: resolution (512, 512), count: 800
	bucket 3: resolution (704, 320), count: 800
	mean ar error (without repeats): 0.0496656946415151
	prepare accelerator
	Using accelerator 0.15.0 or above.
	loading model for process 0/1
	load StableDiffusion checkpoint
	loading u-net: <All keys matched successfully>
	loading vae: <All keys matched successfully>
	Downloading (…)lve/main/config.json: 100% 4.52k/4.52k [00:00<00:00, 3.94MB/s]
	Downloading pytorch_model.bin: 100% 1.71G/1.71G [00:23<00:00, 71.5MB/s]
	loading text encoder: <All keys matched successfully>
	Replace CrossAttention.forward to use xformers
	[Dataset 0]
	caching latents.
	100% 4/4 [00:08<00:00, 2.15s/it]
	import network module: lycoris.kohya
	Using rank adaptation algo: lora
	Use Dropout value: 0.0
	Create LyCORIS Module
	create LyCORIS for Text Encoder: 72 modules.
	Create LyCORIS Module
	create LyCORIS for U-Net: 278 modules.
	enable LyCORIS for text encoder
	enable LyCORIS for U-Net
	prepare optimizer, data loader etc.
	Deprecated: use prepare_optimizer_params(text_encoder_lr, unet_lr, learning_rate) instead of prepare_optimizer_params(text_encoder_lr, unet_lr)
	CUDA SETUP: CUDA runtime path found: /usr/local/cuda-11.8/targets/x86_64-linux/lib/libcudart.so
	CUDA SETUP: Highest compute capability among GPUs detected: 7.5
	CUDA SETUP: Detected CUDA version 118
	CUDA SETUP: Loading binary /usr/local/lib/python3.10/dist-packages/bitsandbytes/libbitsandbytes_cuda118.so...
	use 8-bit AdamW optimizer \| {}
	override steps. steps for 2 epochs is / 指定エポックまでのステップ数: 938
	running training / 学習開始
	num train images * repeats / 学習画像の数×繰り返し回数: 2800
	num reg images / 正則化画像の数: 0
	num batches per epoch / 1epochのバッチ数: 469
	num epochs / epoch数: 2
	batch size per device / バッチサイズ: 6
	gradient accumulation steps / 勾配を合計するステップ数 = 1
	total optimization steps / 学習ステップ数: 938
	steps: 0% 0/938 [00:00<?, ?it/s]epoch 1/2
	steps: 50% 469/938 [11:36<11:36, 1.48s/it, loss=0.0825]saving checkpoint: /content/drive/MyDrive/LoRA/output/megum_AnyLoRA_LocLys-000001.safetensors
	generating sample images at step / サンプル画像生成ステップ: 469
	prompt: (masterpiece, best quality, highres), 1 girl, solo, 20 years drunk japanese girl, big cute green eyes:1.3, cute sharp face, two side up:1.4, long pink hair, hair ribon, bangs, white shirt, brown plaid skirt, blue jacket around waist
	negative_prompt: lowres, bad anatomy, bad hands, text, error, missing fingers, extra digit, fewer digits, cropped, worst quality, low quality, normal quality, jpeg artifacts, signature, watermark, username, blurry
	height: 768
	width: 512
	sample_steps: 28
	scale: 7.0

	0% 0/28 [00:00<?, ?it/s]
	4% 1/28 [00:00<00:13, 1.95it/s]
	7% 2/28 [00:00<00:10, 2.49it/s]
	11% 3/28 [00:01<00:09, 2.70it/s]
	14% 4/28 [00:01<00:08, 2.79it/s]
	18% 5/28 [00:01<00:08, 2.86it/s]
	21% 6/28 [00:02<00:07, 2.91it/s]
	25% 7/28 [00:02<00:07, 2.94it/s]
	29% 8/28 [00:02<00:06, 2.96it/s]
	32% 9/28 [00:03<00:06, 2.97it/s]
	36% 10/28 [00:03<00:06, 2.97it/s]
	39% 11/28 [00:03<00:05, 2.98it/s]
	43% 12/28 [00:04<00:05, 2.99it/s]
	46% 13/28 [00:04<00:05, 2.99it/s]
	50% 14/28 [00:04<00:04, 2.99it/s]
	54% 15/28 [00:05<00:04, 2.99it/s]
	57% 16/28 [00:05<00:04, 2.99it/s]
	61% 17/28 [00:05<00:03, 2.99it/s]
	64% 18/28 [00:06<00:03, 2.99it/s]
	68% 19/28 [00:06<00:03, 2.99it/s]
	71% 20/28 [00:06<00:02, 2.99it/s]
	75% 21/28 [00:07<00:02, 2.99it/s]
	79% 22/28 [00:07<00:02, 2.99it/s]
	82% 23/28 [00:07<00:01, 2.99it/s]
	86% 24/28 [00:08<00:01, 2.99it/s]
	89% 25/28 [00:08<00:01, 2.99it/s]
	93% 26/28 [00:08<00:00, 2.99it/s]
	96% 27/28 [00:09<00:00, 3.00it/s]
	100% 28/28 [00:09<00:00, 2.94it/s]
	epoch 2/2
	steps: 100% 938/938 [23:22<00:00, 1.50s/it, loss=0.0495]generating sample images at step / サンプル画像生成ステップ: 938
	prompt: (masterpiece, best quality, highres), 1 girl, solo, 20 years drunk japanese girl, big cute green eyes:1.3, cute sharp face, two side up:1.4, long pink hair, hair ribon, bangs, white shirt, brown plaid skirt, blue jacket around waist
	negative_prompt: lowres, bad anatomy, bad hands, text, error, missing fingers, extra digit, fewer digits, cropped, worst quality, low quality, normal quality, jpeg artifacts, signature, watermark, username, blurry
	height: 768
	width: 512
	sample_steps: 28
	scale: 7.0

	0% 0/28 [00:00<?, ?it/s]
	4% 1/28 [00:00<00:08, 3.05it/s]
	7% 2/28 [00:00<00:08, 3.01it/s]
	11% 3/28 [00:01<00:08, 2.96it/s]
	14% 4/28 [00:01<00:08, 2.98it/s]
	18% 5/28 [00:01<00:07, 2.99it/s]
	21% 6/28 [00:02<00:07, 2.98it/s]
	25% 7/28 [00:02<00:07, 2.98it/s]
	29% 8/28 [00:02<00:06, 2.99it/s]
	32% 9/28 [00:03<00:06, 2.99it/s]
	36% 10/28 [00:03<00:06, 2.99it/s]
	39% 11/28 [00:03<00:05, 2.99it/s]
	43% 12/28 [00:04<00:05, 2.99it/s]
	46% 13/28 [00:04<00:05, 2.99it/s]
	50% 14/28 [00:04<00:04, 2.99it/s]
	54% 15/28 [00:05<00:04, 2.99it/s]
	57% 16/28 [00:05<00:04, 2.99it/s]
	61% 17/28 [00:05<00:03, 2.99it/s]
	64% 18/28 [00:06<00:03, 3.00it/s]
	68% 19/28 [00:06<00:03, 2.99it/s]
	71% 20/28 [00:06<00:02, 2.99it/s]
	75% 21/28 [00:07<00:02, 3.00it/s]
	79% 22/28 [00:07<00:02, 3.00it/s]
	82% 23/28 [00:07<00:01, 2.99it/s]
	86% 24/28 [00:08<00:01, 3.00it/s]
	89% 25/28 [00:08<00:01, 2.99it/s]
	93% 26/28 [00:08<00:00, 2.99it/s]
	96% 27/28 [00:09<00:00, 2.99it/s]
	100% 28/28 [00:09<00:00, 2.99it/s]
	save trained model to /content/drive/MyDrive/LoRA/output/megum_AnyLoRA_LocLys.safetensors
	model saved.
	steps: 100% 938/938 [23:34<00:00, 1.51s/it, loss=0.0495]