|
|
|
|
|
|
|
|
--- |
|
|
license: mit |
|
|
base_model: runwayml/stable-diffusion-v1-5 |
|
|
tags: |
|
|
- stable-diffusion |
|
|
- diffusion |
|
|
- distillation |
|
|
- flow-matching |
|
|
- geometric-deep-learning |
|
|
- research |
|
|
library_name: diffusers |
|
|
pipeline_tag: text-to-image |
|
|
--- |
|
|
|
|
|
# Why do I hear boss music? |
|
|
|
|
|
## 10000 steps |
|
|
|
|
|
Currently retraining the scale, but it was trained with many raw unscaled latents and it makes the default output hazy. |
|
|
 |
|
|
Use this to correctly orient the output to the correct VAE scale. |
|
|
|
|
|
## Shift 2 is the training target |
|
|
 |
|
|
Higher or lower may yield different results. |
|
|
|
|
|
## use this |
|
|
 |
|
|
|
|
|
|
|
|
|
|
|
a castle at sunset |
|
|
 |
|
|
|
|
|
a mountain view with a beautiful landscape |
|
|
 |
|
|
|
|
|
a woman sitting on the bus |
|
|
 |
|
|
|
|
|
a carrot on a cake |
|
|
 |
|
|
|
|
|
a refrigerator to the left of a table |
|
|
 |
|
|
|
|
|
a mad scientist's laboratory with strange gagets and mechanisms |
|
|
 |
|
|
|
|
|
steampunk goku |
|
|
 |
|
|
|
|
|
|
|
|
a man standing on top of a table in the middle of a room full of curtains. |
|
|
 |
|
|
|
|
|
## 5000 steps |
|
|
|
|
|
|
|
|
 |
|
|
|
|
|
|
|
|
 |
|
|
|
|
|
|
|
|
 |
|
|
|
|
|
 |
|
|
|
|
|
a mad scientists laboratory |
|
|
 |
|
|
|
|
|
## 4000 steps |
|
|
Utilizing this synthesized image set here: |
|
|
https://huggingface.co/datasets/AbstractPhil/sd15-latent-distillation-500k |
|
|
|
|
|
As of typing this, the 500k isn't finished synthesizing. It's at around 200k, which should be more than enough to get a baseline. |
|
|
|
|
|
|
|
|
At 4000 steps the new flow matching trainer is already manifesting results. |
|
|
 |
|
|
|
|
|
 |
|
|
|
|
|
 |
|
|
|
|
|
 |
|
|
|
|
|
|
|
|
Within 4000 steps at batch 16 the pretrained flow matching SD1.5 model is already building convergence. |
|
|
This model was the sd15-flow-matching-try2 aka Lune variation, and I can say for certain she is most definitely not burned. |
|
|
|
|
|
The trainer is in the files. |
|
|
|
|
|
|
|
|
|