YangZhou24 commited on
Commit
6ce214a
·
verified ·
1 Parent(s): 5cc4eb9

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +39 -0
README.md ADDED
@@ -0,0 +1,39 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ language:
3
+ - en
4
+ tags:
5
+ - text-to-image
6
+ - diffusion
7
+ - flux
8
+ - grpo
9
+ - alignment
10
+ pipeline_tag: text-to-image
11
+ base_model: black-forest-labs/FLUX.1-dev
12
+ ---
13
+
14
+ # RealGRPO FLUX DiT Weights
15
+
16
+ This repository provides **DiT weights** fine-tuned from **FLUX.1-dev** with **GRPO** using the **RealGRPO** strategy.
17
+
18
+ RealGRPO targets a common post-training issue in image generation: **reward hacking** (e.g., over-smoothing, over-saturation, and synthetic-looking artifacts).
19
+ Compared with vanilla FLUX and standard GRPO baselines, these weights are optimized to better preserve prompt intent while reducing reward-driven artifacts.
20
+
21
+ ## What Is Included
22
+
23
+ - Fine-tuned FLUX DiT weights (GRPO post-training).
24
+ - Training objective based on contrastive positive/negative style guidance.
25
+ - Compatibility with the RealGRPO codebase inference scripts.
26
+
27
+ ## Method (Brief)
28
+
29
+ RealGRPO uses a LLM to generate prompt-specific style pairs:
30
+ - positive style cues (`pos_style`)
31
+ - negative style cues (`neg_style`)
32
+
33
+ The reward encourages similarity to positive cues while penalizing negative cues, helping the model avoid artifact-prone shortcuts during alignment.
34
+
35
+ > Note: This release contains DiT alignment weights, not a standalone full pipeline package. You need download black-forest-labs/FLUX.1-dev and replace the contents of the `transfermer` directory with the contents of this repository.
36
+
37
+
38
+
39
+