Diffusers
lora
TheRemixer commited on
Commit
fee8b04
·
verified ·
1 Parent(s): e0570e1

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +28 -0
README.md ADDED
@@ -0,0 +1,28 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ base_model:
4
+ - CabalResearch/NoobAI-Flux2VAE-RectifiedFlow-0.3
5
+ library_name: diffusers
6
+ tags:
7
+ - lora
8
+ ---
9
+ # Files:
10
+ ## **Loras/Mugen_NoobFlux2RF_Test-000003.safetensors**
11
+
12
+ Produces different results, *normally* worse, trained for [Mugen](https://huggingface.co/CabalResearch/Mugen) with an auxillary training objective from https://arxiv.org/abs/2411.04873.
13
+
14
+ I decode both the ground truth latent and predicited clean latent, and take the first two up_blocks of the VAE when decoding both the ground truth latent and predicited latent, then compare them against each other using L2 loss and add that back onto the regular flow matching loss at a weight of 0.1
15
+
16
+ (So `loss_total = flow matching + 0.1 * latent_perceptual_loss`). I only did this if the timestep was less than 50% (`sigmas < 0.5`).
17
+
18
+ ## **Loras/RF-Flux2VAE-Consistency-Test-50-000002.safetensors**
19
+
20
+ trained by generating one forward pass from 20-50 timestep before the target timestep with **no gradients**. Then simulating one euler step to the target timestep and using the resulting latent as the input to the model and training on that. So the model is trained on `clean latent + noise + discretization error + mispredicition error from previous step`
21
+
22
+ My thinking was at inference time the model **doesn't** only receive the `clean latent + gaussian noise` like in training, but also `+ discretization error + mispredicition error from previous step`
23
+
24
+ Works on [Mugen](https://huggingface.co/CabalResearch/Mugen) too, but trained on [NoobAI-Flux2VAE-RectifiedFlow-0.3](https://huggingface.co/CabalResearch/NoobAI-Flux2VAE-RectifiedFlow-0.3)
25
+
26
+ ## **Loras/RF-Flux2VAE-Consistency-Test-000002.safetensors**
27
+
28
+ Same as above, with bigger but worse dataset. Use the above one (the one with 50 in the name) generally.