Update README.md
Browse files
README.md
CHANGED
|
@@ -1,3 +1,82 @@
|
|
| 1 |
-
---
|
| 2 |
-
license: apache-2.0
|
| 3 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
---
|
| 2 |
+
license: apache-2.0
|
| 3 |
+
language:
|
| 4 |
+
- en
|
| 5 |
+
metrics:
|
| 6 |
+
- accuracy
|
| 7 |
+
base_model:
|
| 8 |
+
- ByteDance-Seed/BAGEL-7B-MoT
|
| 9 |
+
pipeline_tag: text-to-image
|
| 10 |
+
---
|
| 11 |
+
|
| 12 |
+
---
|
| 13 |
+
license: apache-2.0
|
| 14 |
+
language:
|
| 15 |
+
- en
|
| 16 |
+
- zh
|
| 17 |
+
tags:
|
| 18 |
+
- text-to-image
|
| 19 |
+
- fake-image-detection
|
| 20 |
+
- unigendet
|
| 21 |
+
- bagel
|
| 22 |
+
---
|
| 23 |
+
|
| 24 |
+
<h1 align="center">[CVPR 2026] UniGenDet: A Unified Generative-Discriminative Framework</h1>
|
| 25 |
+
|
| 26 |
+
<p align="center">
|
| 27 |
+
<b>
|
| 28 |
+
<a href="https://github.com/Zhangyr2022/">Yanran Zhang</a>,
|
| 29 |
+
<a href="https://wzzheng.net/#">Wenzhao Zheng</a><sup>†</sup>,
|
| 30 |
+
<a href="https://joeleelyf.github.io/">Yifei Li</a>,
|
| 31 |
+
<a href="https://yuby14.github.io/">Bingyao Yu</a>,
|
| 32 |
+
<a href="https://yzheng97.github.io/">Yu Zheng</a>,
|
| 33 |
+
<a href="https://leichenthu.github.io/">Lei Chen</a>,
|
| 34 |
+
<a href="https://scholar.google.com/citations?user=6a79aPwAAAAJ&hl=en">Jie Zhou</a><sup>*</sup>,
|
| 35 |
+
<a href="https://ivg.au.tsinghua.edu.cn/Jiwen_Lu/">Jiwen Lu</a>
|
| 36 |
+
</b>
|
| 37 |
+
<br/>
|
| 38 |
+
Department of Automation, Tsinghua University, China
|
| 39 |
+
<br/>
|
| 40 |
+
<sup>*</sup>Corresponding author <sup>†</sup>Project leader
|
| 41 |
+
</p>
|
| 42 |
+
|
| 43 |
+
<p align="center">
|
| 44 |
+
<img src="https://cdn-uploads.huggingface.co/production/uploads/661cfae9a853782abad2a495/lBHJD1nNztgmdwc_WqVli.png" width="100%" alt="UniGenDet Teaser"/>
|
| 45 |
+
</p>
|
| 46 |
+
|
| 47 |
+
**UniGenDet** is a unified co-evolutionary framework that jointly optimizes image generation and generated-image detection in a single loop. By bridging generation and authenticity understanding through symbiotic multimodal self-attention, UniGenDet turns the traditional "generator vs. detector" arms race into a closed-loop collaboration.
|
| 48 |
+
|
| 49 |
+
This repository hosts the fine-tuned model weights for UniGenDet.
|
| 50 |
+
|
| 51 |
+
### 🔗 Links
|
| 52 |
+
- **GitHub Repository (Code & Detailed Instructions):** [Zhangyr2022/UniGenDet](https://github.com/Zhangyr2022/UniGenDet)
|
| 53 |
+
- **Paper (arXiv):** [2604.21904](https://arxiv.org/abs/2604.21904v1)
|
| 54 |
+
- **Project Website:** [UniGenDet Project Page](https://ivg-yanranzhang.github.io/UniGenDet/)
|
| 55 |
+
|
| 56 |
+
### 🚀 Getting Started
|
| 57 |
+
|
| 58 |
+
The UniGenDet model supports two main tasks:
|
| 59 |
+
1. **Text-to-Image Generation (`t2i`)**
|
| 60 |
+
2. **AI-Generated Image Detection and Explanation (`detection`)**
|
| 61 |
+
|
| 62 |
+
To use these weights for generation, detection, or further fine-tuning, please refer to the official [GitHub repository](https://github.com/Zhangyr2022/UniGenDet). The repository provides a comprehensive `demo.py` script for interactive inference.
|
| 63 |
+
|
| 64 |
+
**Quick Inference Example Setup:**
|
| 65 |
+
1. Clone the GitHub repository: `git clone https://github.com/Zhangyr2022/UniGenDet.git`
|
| 66 |
+
2. Install dependencies as outlined in the repo's `README.md`.
|
| 67 |
+
3. Download the base BAGEL pretrained assets.
|
| 68 |
+
4. Run `demo.py` pointing to this Hugging Face model directory.
|
| 69 |
+
|
| 70 |
+
For complete installation, data preparation, training (GDUF/DIGA), and evaluation instructions, please consult the [main GitHub repository](https://github.com/Zhangyr2022/UniGenDet).
|
| 71 |
+
|
| 72 |
+
### Citation
|
| 73 |
+
|
| 74 |
+
```bibtex
|
| 75 |
+
@article{zhang2026unigendet,
|
| 76 |
+
title = {UniGenDet: A Unified Generative-Discriminative Framework for Co-Evolutionary Image Generation and Generated Image Detection},
|
| 77 |
+
author = {Zhang, Yanran and Zheng, Wenzhao and Li, Yifei and Yu, Bingyao and Zheng, Yu and Chen, Lei and Zhou, Jie and Lu, Jiwen},
|
| 78 |
+
journal = {CoRR},
|
| 79 |
+
volume = {abs/2604.21904},
|
| 80 |
+
year = {2026},
|
| 81 |
+
url = {[https://arxiv.org/abs/2604.21904](https://arxiv.org/abs/2604.21904)},
|
| 82 |
+
}
|