nielsr HF Staff commited on
Commit
c86c072
Β·
verified Β·
1 Parent(s): 53d9042

Add pipeline tag and improve model card

Browse files

Hi! I'm Niels from the Hugging Face team. I'm opening this PR to improve the model card for DreamID-V:
- Added the `image-to-video` pipeline tag to the metadata for better discoverability.
- Fixed the broken Arxiv link in the README.
- Added a link to the official GitHub repository.
- Standardized the citation to BibTeX format.

Files changed (1) hide show
  1. README.md +15 -12
README.md CHANGED
@@ -1,21 +1,24 @@
1
  ---
2
  license: apache-2.0
 
3
  ---
 
4
  # DreamID-V: Bridging the Image-to-Video Gap for High-Fidelity Face Swapping via Diffusion Transformer
5
 
6
  <p align="center">
7
  <a href="https://guoxu1233.github.io/DreamID-V/">🌐 Project Page</a> |
8
- <a href="https://arxiv.org5">πŸ“œ Arxiv</a> |
 
9
  <a href="https://huggingface.co/XuGuo699/DreamID-V">πŸ€— Models</a> |
10
  </p>
11
 
12
  > **DreamID-V: Bridging the Image-to-Video Gap for High-Fidelity Face Swapping via Diffusion Transformer**<br>
13
- > [Xu Guo](https://github.com/Guoxu1233/)<sup> * </sup>, [Fulong Ye](https://scholar.google.com/citations?user=-BbQ5VgAAAAJ&hl=zh-CN/)<sup> * </sup>, [Xinghui Li](https://crayon-shinchan.github.io/xinghui99.github.io/)<sup> *</sup>, [Pengqi Tu](https://openreview.net/profile?id=%7EPengqi_Tu1), [Pengze Zhang](https://pangzecheung.github.io/Homepage/), [Qichao Sun](https://github.com/sun631998316), [Songtao Zhao](https://openreview.net/profile?id=~Songtao_Zhao1)<sup> &dagger;</sup>, [Xiangwang Hou](https://scholar.google.com/citations?user=bpskf9kAAAAJ&hl=zh-CN)<sup> &dagger;</sup> [Qian He](https://scholar.google.com/citations?user=9rWWCgUAAAAJ)
14
- > <br><sup> * </sup>Equal contribution,<sup> &dagger; </sup>Corresponding author
15
  > <br>Tsinghua University | Intelligent Creation Team, ByteDance<br>
16
 
17
  <p align="center">
18
- <img src="teaser.png" width=95%>
19
  <p>
20
 
21
  ## ⚑️ Quickstart
@@ -43,8 +46,8 @@ pip install -r requirements.txt
43
  ``` sh
44
  python generate_dreamidv.py \
45
  --size 832*480 \
46
- --ckpt_dir wan2.1-1.3B path \
47
- --dreamidv_ckpt dreamidv.pth path \
48
  --sample_steps 50 \
49
  --base_seed 42
50
  ```
@@ -55,8 +58,8 @@ python generate_dreamidv.py \
55
  pip install "xfuser>=0.4.1"
56
  torchrun --nproc_per_node=2 generate_dreamidv.py \
57
  --size 832*480 \
58
- --ckpt_dir wan2.1-1.3B path \
59
- --dreamidv_ckpt dreamidv.pth path \
60
  --sample_steps 50 \
61
  --dit_fsdp \
62
  --t5_fsdp \
@@ -70,14 +73,14 @@ Our work builds upon and is greatly inspired by several outstanding open-source
70
 
71
 
72
  ## πŸ“§ Contact
73
- If you have any comments or questions regarding this open-source project, please open a new issue or contact [Xu Guo](https://github.com/Guoxu1233/).
74
 
75
 
76
  ## ⭐ Citation
77
 
78
- If you find our work helpful, please consider citing our paper and leaving valuable stars
79
 
80
- ```
81
  @misc{guo2026dreamidvbridgingimagetovideogaphighfidelity,
82
  title={DreamID-V:Bridging the Image-to-Video Gap for High-Fidelity Face Swapping via Diffusion Transformer},
83
  author={Xu Guo and Fulong Ye and Xinghui Li and Pengqi Tu and Pengze Zhang and Qichao Sun and Songtao Zhao and Xiangwang Hou and Qian He},
@@ -87,4 +90,4 @@ If you find our work helpful, please consider citing our paper and leaving valua
87
  primaryClass={cs.CV},
88
  url={https://arxiv.org/abs/2601.01425},
89
  }
90
- ```
 
1
  ---
2
  license: apache-2.0
3
+ pipeline_tag: image-to-video
4
  ---
5
+
6
  # DreamID-V: Bridging the Image-to-Video Gap for High-Fidelity Face Swapping via Diffusion Transformer
7
 
8
  <p align="center">
9
  <a href="https://guoxu1233.github.io/DreamID-V/">🌐 Project Page</a> |
10
+ <a href="https://arxiv.org/abs/2601.01425">πŸ“œ Arxiv</a> |
11
+ <a href="https://github.com/bytedance/DreamID-V">πŸ’» GitHub</a> |
12
  <a href="https://huggingface.co/XuGuo699/DreamID-V">πŸ€— Models</a> |
13
  </p>
14
 
15
  > **DreamID-V: Bridging the Image-to-Video Gap for High-Fidelity Face Swapping via Diffusion Transformer**<br>
16
+ > [Xu Guo](https://github.com/Guoxu1233/)<sup> * </sup>, [Fulong Ye](https://scholar.google.com/citations?user=-BbQ5VgAAAAJ&hl=zh-CN/)<sup> * </sup>, [Xinghui Li](https://crayon-shinchan.github.io/xinghui99.github.io/)<sup> *</sup>, [Pengqi Tu](https://openreview.net/profile?id=%7EPengqi_Tu1), [Pengze Zhang](https://pangzecheung.github.io/Homepage/), [Qichao Sun](https://github.com/sun631998316), [Songtao Zhao](https://openreview.net/profile?id=~Songtao_Zhao1)<sup> †</sup>, [Xiangwang Hou](https://scholar.google.com/citations?user=bpskf9kAAAAJ&hl=zh-CN)<sup> †</sup> [Qian He](https://scholar.google.com/citations?user=9rWWCgUAAAAJ)
17
+ > <br><sup> * </sup>Equal contribution,<sup> † </sup>Corresponding author
18
  > <br>Tsinghua University | Intelligent Creation Team, ByteDance<br>
19
 
20
  <p align="center">
21
+ <img src="https://huggingface.co/XuGuo699/DreamID-V/resolve/main/teaser.png" width=95%>
22
  <p>
23
 
24
  ## ⚑️ Quickstart
 
46
  ``` sh
47
  python generate_dreamidv.py \
48
  --size 832*480 \
49
+ --ckpt_dir wan2.1-1.3B_path \
50
+ --dreamidv_ckpt dreamidv.pth_path \
51
  --sample_steps 50 \
52
  --base_seed 42
53
  ```
 
58
  pip install "xfuser>=0.4.1"
59
  torchrun --nproc_per_node=2 generate_dreamidv.py \
60
  --size 832*480 \
61
+ --ckpt_dir wan2.1-1.3B_path \
62
+ --dreamidv_ckpt dreamidv.pth_path \
63
  --sample_steps 50 \
64
  --dit_fsdp \
65
  --t5_fsdp \
 
73
 
74
 
75
  ## πŸ“§ Contact
76
+ If you have any comments or questions regarding this open-source project, please open a new issue or contact [Xu Guo](https://github.com/Guoxu1233/) and [Fulong Ye](https://github.com/superhero-7).
77
 
78
 
79
  ## ⭐ Citation
80
 
81
+ If you find our work helpful, please consider citing our paper:
82
 
83
+ ```bibtex
84
  @misc{guo2026dreamidvbridgingimagetovideogaphighfidelity,
85
  title={DreamID-V:Bridging the Image-to-Video Gap for High-Fidelity Face Swapping via Diffusion Transformer},
86
  author={Xu Guo and Fulong Ye and Xinghui Li and Pengqi Tu and Pengze Zhang and Qichao Sun and Songtao Zhao and Xiangwang Hou and Qian He},
 
90
  primaryClass={cs.CV},
91
  url={https://arxiv.org/abs/2601.01425},
92
  }
93
+ ```