suzushi commited on
Commit
5dcb44b
·
verified ·
1 Parent(s): 3d5fc25

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +48 -0
README.md ADDED
@@ -0,0 +1,48 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ language:
3
+ - en
4
+ license_name: stabilityai-ai-community
5
+ license_link: LICENSE.md
6
+ library_name: diffusers
7
+ pipeline_tag: text-to-image
8
+ tags:
9
+ - text-to-image
10
+ base_model:
11
+ - suzushi/miso-diffusion-m-1.0
12
+ - stabilityai/stable-diffusion-3.5-medium
13
+ ---
14
+
15
+ <div style="display: flex; justify-content: center; gap: 20px; margin-bottom: 20px;">
16
+ <img src="demo1.png" width="400" />
17
+ <img src="demo2.png" width="400" />
18
+ </div>
19
+ # Anime SD3.5 medium Model
20
+ An attempt to fine tune sd3.5 medium
21
+
22
+ ## Version History
23
+
24
+ | Version | Base Training | Aesthetic Training | Total Epochs |
25
+ |---------|--------------|-------------------|--------------|
26
+ | alpha | 250K images | 0 images | 1 |
27
+ | beta | 160K images | 0 images | 3 |
28
+ | 1.0 | 600k images | 0 images | 2 + (3 from beta) |
29
+ | 1.1 | 710k images | 0 images | 5 |
30
+
31
+ ## Training Methodology
32
+
33
+ Training is done on gh200 with 96gb vram
34
+
35
+ Training setting: Adafactor with a batchsize of 40, lr_scheduler: cosine
36
+ SD3.5 Specific setting:
37
+ enable_scaled_pos_embed = true
38
+
39
+ pos_emb_random_crop_rate = 0.2
40
+
41
+ weighting_scheme = "flow"
42
+ learning_rate = 3.5e-6
43
+
44
+ learning_rate_te1 = 2.5e-6
45
+
46
+ learning_rate_te2 = 2.5e-6
47
+
48
+ Train Clip: true, Train t5xxl: false