Bernard Maltais commited on
Commit
65c5428
·
1 Parent(s): 2d550de

1st commit

Browse files
.gitattributes CHANGED
@@ -32,3 +32,4 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
32
  *.zip filter=lfs diff=lfs merge=lfs -text
33
  *.zst filter=lfs diff=lfs merge=lfs -text
34
  *tfevents* filter=lfs diff=lfs merge=lfs -text
 
 
32
  *.zip filter=lfs diff=lfs merge=lfs -text
33
  *.zst filter=lfs diff=lfs merge=lfs -text
34
  *tfevents* filter=lfs diff=lfs merge=lfs -text
35
+ *.ckpt filter=lfs diff=lfs merge=lfs -text
README.md CHANGED
@@ -1,3 +1,52 @@
1
  ---
2
  license: unknown
3
  ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
  license: unknown
3
  ---
4
+ # The White Queen Dreambooth Model
5
+
6
+ The tweo models included were trained to compare a novel approach that I call "multi-resolution" dreambooth
7
+
8
+ I have combined instance images with two resolution:
9
+
10
+ - 512x512
11
+ - 640x448
12
+
13
+ One model was produced using the multi-subject feature of this dreambooth fork: https://github.com/ShivamShrirao/diffusers
14
+
15
+ That fork was slightly modified to allow it to handle non square resolutions and to allow various subject instances to use different sizes.
16
+
17
+ The other one was produced by creating two seperate models (one for each different instance resolution) and them merging them according to the number of instance images in each.
18
+
19
+ Based on preliminary testing it appear that the multi-subject approach provides the best results.
20
+
21
+ Here is the shivam configuration used for the multi-subject training:
22
+
23
+ ```
24
+ # multi-subject for different ratio processing
25
+
26
+ accelerate launch --num_cpu_threads_per_process 6 train_dreambooth.py `
27
+ --pretrained_model_name_or_path="runwayml/stable-diffusion-v1-5" `
28
+ --pretrained_vae_name_or_path="stabilityai/sd-vae-ft-mse" `
29
+ --with_prior_preservation --prior_loss_weight=1 `
30
+ --seed=494481440 `
31
+ --resolution=448 `
32
+ --train_batch_size=4 `
33
+ --train_text_encoder `
34
+ --mixed_precision="fp16" `
35
+ --revision="fp16" `
36
+ --use_8bit_adam `
37
+ --gradient_checkpointing `
38
+ --gradient_accumulation_steps=1 `
39
+ --learning_rate=1e-6 `
40
+ --lr_scheduler="constant" `
41
+ --lr_warmup_steps=0 `
42
+ --num_class_images=256 `
43
+ --sample_batch_size=4 `
44
+ --max_train_steps=1500 `
45
+ --save_interval=400 `
46
+ --save_sample_prompt="twq woman" `
47
+ --concepts_list="concepts_list_twq.json" `
48
+ --output_dir="./twq/output-multi" `
49
+ --n_save_sample=4 `
50
+ --save_min_steps=400 `
51
+ --pad_tokens
52
+ ```
twq_woman-shiv-1200-multires-merge.ckpt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:259d11d6c8122253de519b598c1f1b16dec2e45d0d07924308e7660cb56babd2
3
+ size 2132856622
twq_woman-shiv-1200-multires.ckpt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:2d4f02ebe3c940e535c8ae248472daeceec3815e5f88782a060247286298d239
3
+ size 2132856622