File size: 1,603 Bytes
731366c
 
 
 
 
 
 
 
 
fe83bcb
731366c
 
 
 
 
 
 
 
 
 
 
bf5e616
 
 
 
 
 
 
 
 
 
731366c
 
 
 
 
 
 
 
 
cf8bf20
731366c
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
---
library_name: diffusers
license: mit
pipeline_tag: text-to-image
base_model:
- black-forest-labs/FLUX.1-dev
---

# Model Summary
This model is GRPO trained using [UnifiedReward-Flex](https://huggingface.co/collections/CodeGoat24/unifiedreward-flex) as reward on the training dataset of [UniGenBench](https://github.com/CodeGoat24/UniGenBench).

๐Ÿš€ The inference code is available at [Github](https://github.com/CodeGoat24/Pref-GRPO/blob/main/inference/flux_dist_infer.sh).


For further details, please refer to the following resources:
- ๐Ÿ“ฐ Paper: https://arxiv.org/abs/2602.02380
- ๐Ÿช Project Page: https://codegoat24.github.io/UnifiedReward/flex
- ๐Ÿค— Model Collections: https://huggingface.co/collections/CodeGoat24/unifiedreward-flex
- ๐Ÿค— Dataset: https://huggingface.co/datasets/CodeGoat24/UnifiedReward-Flex-SFT-90K
- ๐Ÿ‘‹ Point of Contact: [Yibin Wang](https://codegoat24.github.io)

# Qualitative Results
![image](https://cdn-uploads.huggingface.co/production/uploads/654c6845bac6e6e49895a5b5/6BCPeZmjBpATJfBpfh-WX.png)



![image](https://cdn-uploads.huggingface.co/production/uploads/654c6845bac6e6e49895a5b5/lx0bXWyXT60zUaYz3vTNe.png)


# Quantitative Results
![image](https://cdn-uploads.huggingface.co/production/uploads/654c6845bac6e6e49895a5b5/42ojNtAOR9Krj5RYPSdfB.png)




## Citation

```bibtex
@article{unifiedreward-flex,
  title={Unified Personalized Reward Model for Vision Generation},
  author={Wang, Yibin and Zang, Yuhang and Han, Feng and Bu, Jiazi and Zhou, Yujie and Jin, Cheng and Wang, Jiaqi},
  journal={arXiv preprint arXiv:2602.02380},
  year={2026}
}
```