meteorite2023 commited on
Commit
e590bb4
Β·
verified Β·
1 Parent(s): 5a02a8c

Upload .\assets with huggingface_hub

Browse files
Files changed (1) hide show
  1. .//assets +91 -0
.//assets ADDED
@@ -0,0 +1,91 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+
2
+ # DynamicID: Zero-Shot Multi-ID Image Personalization with Flexible Facial Editability
3
+
4
+ <div align="center">
5
+
6
+ ### [ICCV 2025]
7
+
8
+ [Xirui Hu](https://openreview.net/profile?id=~Xirui_Hu1),
9
+ [Jiahao Wang](https://openreview.net/profile?id=~Jiahao_Wang14),
10
+ [Hao Chen](https://openreview.net/profile?id=~Hao_chen100),
11
+ [Weizhan Zhang](https://openreview.net/profile?id=~Weizhan_Zhang1),
12
+ [Benqi Wang](https://openreview.net/profile?id=~Benqi_Wang2),
13
+ [Yikun Li](https://openreview.net/profile?id=~Yikun_Li1),
14
+ [Haishun Nan](https://openreview.net/profile?id=~Haishun_Nan1),
15
+
16
+ [![arXiv](https://img.shields.io/badge/arXiv-2503.06505-b31b1b.svg)](https://arxiv.org/abs/2503.06505)
17
+ [![GitHub](https://img.shields.io/badge/GitHub-MTVCrafter-blue?logo=github)](https://github.com/ByteCat-bot/DynamicID)
18
+ </div>
19
+
20
+ ---
21
+ This is the official implementation of DynamicID, a framework that generates visually harmonious image featuring **multiple individuals**. Each person in the image can be specified through user-provided reference images, and most notably, our method enables **independent control of each individual's facial expression** via text prompts. Hope you have fun with this demo!
22
+
23
+ ---
24
+
25
+ ## πŸ” Abstract
26
+
27
+ Recent advancements in text-to-image generation have spurred interest in personalized human image generation. Although existing methods achieve high-fidelity identity preservation, they often struggle with **limited multi-ID usability** and **inadequate facial editability**.
28
+
29
+ We present DynamicID, a tuning-free framework that inherently facilitates both single-ID and multi-ID personalized generation with high fidelity and flexible facial editability. Our key innovations include:
30
+
31
+ - Semantic-Activated Attention (SAA), which employs query-level activation gating to minimize disruption to the original model when injecting ID features and achieve multi-ID personalization without requiring multi-ID samples during training.
32
+
33
+ - Identity-Motion Reconfigurator (IMR), which applies feature-space manipulation to effectively disentangle and reconfigure facial motion and identity features, supporting flexible facial editing.
34
+
35
+ - A task-decoupled training paradigm that reduces data dependency
36
+
37
+ - A curated VariFace-10k facial dataset, comprising 10k unique individuals, each represented by 35 distinct facial images.
38
+
39
+ Experimental results demonstrate that DynamicID outperforms state-of-the-art methods in identity fidelity, facial editability, and multi-ID personalization capability.
40
+
41
+ ## πŸ’‘ Method
42
+
43
+ <div align="center">
44
+ <img src="assets/pipeline.jpg", width="1000">
45
+ </div>
46
+
47
+ The proposed framework is architected around two core components: SAA and IMR. (a) In the anchoring stage, we jointly optimize the SAA and a face encoder to establish robust single-ID and multi-ID personalized generation capabilities. (b) Subsequently in the reconfiguration stage, we freeze these optimized components and leverage them to train the IMR for flexible and fine-grained facial editing.
48
+
49
+ ## πŸš€ Checkpoint
50
+
51
+ 1. Download the pretrained Stable Diffusion v1.5 checkpoint from [Stable Diffusion v1.5 on Hugging Face](https://huggingface.co/stable-diffusion-v1-5/stable-diffusion-v1-5).
52
+
53
+ 2. Download our SAA-related and IMR-related checkpoints from [DynamicID Checkpoints on Hugging Face](https://huggingface.co/meteorite2023/DynamicID).
54
+
55
+
56
+ ## 🌈 Gallery
57
+
58
+ <div align="center">
59
+ <img src="assets/teaser.jpg", width="900">
60
+ <br><br><br>
61
+ <img src="assets/single.jpg", width="900">
62
+ <br><br><br>
63
+ <img src="assets/multi.jpg", width="900">
64
+ </div>
65
+
66
+ ## πŸ“Œ ToDo List
67
+
68
+ - [x] Release technical report
69
+ - [x] Release **training and inference code**
70
+ - [x] Release **Dynamic-sd** (based on *stable diffusion v1.5*)
71
+ - [ ] Release **Dynamic-flux** (based on *Flux-dev*)
72
+ - [ ] Release a Hugging Face Demo Space
73
+
74
+ ## πŸ“– Citation
75
+ If you are inspired by our work, please cite our paper.
76
+ ```bibtex
77
+ @inproceedings{dynamicid,
78
+ title={DynamicID: Zero-Shot Multi-ID Image Personalization with Flexible Facial Editability},
79
+ author={Xirui Hu,
80
+ Jiahao Wang,
81
+ Hao Chen,
82
+ Weizhan Zhang,
83
+ Benqi Wang,
84
+ Yikun Li,
85
+ Haishun Nan
86
+ },
87
+ booktitle={International Conference on Computer Vision},
88
+ year={2025}
89
+ }
90
+
91
+ ```