fix
Browse files
README.md
CHANGED
|
@@ -153,18 +153,18 @@ VAE: Changed, new VAE - [EQB7](https://huggingface.co/Anzhc/MS-LC-EQ-D-VR_VAE) w
|
|
| 153 |
(Base / quality-tuned)
|
| 154 |
|
| 155 |
**Samples seen**(unbatched steps): ~2kk / ~400k
|
| 156 |
-
**Learning Rate**: 2e-5 / 2e-5
|
| 157 |
-
**Effective Batch size**: 1280 (40 real * 4 accum * 8 devices) / 1280 (40 * 4 * 8)
|
| 158 |
-
**Precision**: Full BF16
|
| 159 |
-
**Optimizer**: AdamW8bit with Kahan Summation
|
| 160 |
**Weight Decay**: 0.01
|
| 161 |
-
**Schedule**: Constant with warmup
|
| 162 |
**Timestep Sampling Strategy**: Logit-Normal (sometimes referred to as Lognorm), Shift 2.5
|
| 163 |
-
**Text Encoders**: Frozen
|
| 164 |
**Keep Token**: False (Used "Protected Tags" instead), all tags are shuffled.
|
| 165 |
-
**Tag Dropout**: 10%
|
| 166 |
**Uncond Dropout**: 10%
|
| 167 |
-
**Optimal Transport**: True
|
| 168 |
|
| 169 |
|
| 170 |
|
|
@@ -193,13 +193,13 @@ My current style training settings (Anzhc):
|
|
| 193 |
**Schedule**: ReREX (Use REX for simplicity)
|
| 194 |
**Precision**: Full BF16
|
| 195 |
**Weight Decay**: 0.02
|
| 196 |
-
**Timestep Sampling Strategy**: Logit-Normal, Shift 2.5 (Closest to what i use result-wise)
|
| 197 |
|
| 198 |
-
**Dim/Alpha/Conv/Alpha**: 24/24/24/24 (Lycoris/Locon)
|
| 199 |
|
| 200 |
-
**Text Encoders**: Frozen
|
| 201 |
|
| 202 |
-
**Optimal Transport**: True
|
| 203 |
|
| 204 |
**Expected Dataset Size**: 100 images (Can be even 10, but balance with repeats to roughly this target.)
|
| 205 |
**Epochs**: 50 (Yes, even with 10 repeats. 500 effective epochs works just fine and doesn't break from my tests.)
|
|
|
|
| 153 |
(Base / quality-tuned)
|
| 154 |
|
| 155 |
**Samples seen**(unbatched steps): ~2kk / ~400k
|
| 156 |
+
**Learning Rate**: 2e-5 / 2e-5
|
| 157 |
+
**Effective Batch size**: 1280 (40 real * 4 accum * 8 devices) / 1280 (40 * 4 * 8)
|
| 158 |
+
**Precision**: Full BF16
|
| 159 |
+
**Optimizer**: AdamW8bit with Kahan Summation
|
| 160 |
**Weight Decay**: 0.01
|
| 161 |
+
**Schedule**: Constant with warmup
|
| 162 |
**Timestep Sampling Strategy**: Logit-Normal (sometimes referred to as Lognorm), Shift 2.5
|
| 163 |
+
**Text Encoders**: Frozen
|
| 164 |
**Keep Token**: False (Used "Protected Tags" instead), all tags are shuffled.
|
| 165 |
+
**Tag Dropout**: 10%
|
| 166 |
**Uncond Dropout**: 10%
|
| 167 |
+
**Optimal Transport**: True
|
| 168 |
|
| 169 |
|
| 170 |
|
|
|
|
| 193 |
**Schedule**: ReREX (Use REX for simplicity)
|
| 194 |
**Precision**: Full BF16
|
| 195 |
**Weight Decay**: 0.02
|
| 196 |
+
**Timestep Sampling Strategy**: Logit-Normal, Shift 2.5 (Closest to what i use result-wise)
|
| 197 |
|
| 198 |
+
**Dim/Alpha/Conv/Alpha**: 24/24/24/24 (Lycoris/Locon)
|
| 199 |
|
| 200 |
+
**Text Encoders**: Frozen
|
| 201 |
|
| 202 |
+
**Optimal Transport**: True
|
| 203 |
|
| 204 |
**Expected Dataset Size**: 100 images (Can be even 10, but balance with repeats to roughly this target.)
|
| 205 |
**Epochs**: 50 (Yes, even with 10 repeats. 500 effective epochs works just fine and doesn't break from my tests.)
|