AIBRUH commited on
Commit
470f6d9
·
verified ·
1 Parent(s): f794a80

Upload CREDITS.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. CREDITS.md +64 -0
CREDITS.md ADDED
@@ -0,0 +1,64 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ # EDEN OS V2 — Credits & Acknowledgements
2
+
3
+ ## Hallo — Portrait Image Animation
4
+
5
+ EDEN OS V2's face animation system is powered by **Hallo**, developed by the
6
+ **Fudan University Generative Vision Lab** (fudan-generative-vision).
7
+
8
+ We extend our deepest gratitude and thanks to the Hallo team for their
9
+ groundbreaking research in audio-driven portrait animation.
10
+
11
+ ### Hallo4: High-Fidelity Dynamic Portrait Animation via Direct Preference Optimization
12
+ - **Paper**: SIGGRAPH Asia 2025
13
+ - **Authors**: Jiahao Cui, Baoyou Chen, Mingwang Xu, Hanlin Shang, Yuxuan Chen,
14
+ Yun Zhan, Zilong Dong, Yao Yao, Jingdong Wang, Siyu Zhu
15
+ - **Institutions**: Fudan University, Baidu Inc, Nanjing University, Alibaba Group
16
+ - **Repository**: https://github.com/fudan-generative-vision/hallo4
17
+ - **Model**: https://huggingface.co/fudan-generative-ai/hallo4
18
+ - **arXiv**: https://arxiv.org/abs/2505.23525
19
+
20
+ ### Hallo3: Highly Dynamic and Realistic Portrait Image Animation with Diffusion Transformer Networks
21
+ - **Authors**: Jiahao Cui, Hui Li, Yun Zhan, et al.
22
+ - **Repository**: https://github.com/fudan-generative-vision/hallo3
23
+ - **Model**: https://huggingface.co/fudan-generative-ai/hallo3
24
+ - **arXiv**: https://arxiv.org/abs/2412.00733
25
+
26
+ ### Citation
27
+
28
+ ```bibtex
29
+ @misc{cui2025hallo4,
30
+ title={Hallo4: High-Fidelity Dynamic Portrait Animation via Direct Preference Optimization},
31
+ author={Jiahao Cui and Baoyou Chen and Mingwang Xu and Hanlin Shang and
32
+ Yuxuan Chen and Yun Zhan and Zilong Dong and Yao Yao and
33
+ Jingdong Wang and Siyu Zhu},
34
+ year={2025},
35
+ eprint={2505.23525},
36
+ archivePrefix={arXiv},
37
+ primaryClass={cs.CV}
38
+ }
39
+
40
+ @misc{cui2024hallo3,
41
+ title={Hallo3: Highly Dynamic and Realistic Portrait Image Animation
42
+ with Diffusion Transformer Networks},
43
+ author={Jiahao Cui and Hui Li and Yun Zhang and Hanlin Shang and
44
+ Kaihui Cheng and Yuqi Ma and Shan Mu and Hang Zhou and
45
+ Jingdong Wang and Siyu Zhu},
46
+ year={2024},
47
+ eprint={2412.00733},
48
+ archivePrefix={arXiv},
49
+ primaryClass={cs.CV}
50
+ }
51
+ ```
52
+
53
+ ## Additional Technologies
54
+
55
+ - **Edge TTS** — Microsoft Edge Text-to-Speech for Eve's voice
56
+ - **AvatarForcing** — One-step streaming talking avatars (arXiv:2603.14331)
57
+ - **Wav2Vec2** — Facebook's audio encoder (facebook/wav2vec2-base-960h)
58
+ - **WAN2.1** — Base video generation model (Wan-AI/Wan2.1-T2V-1.3B)
59
+ - **MediaPipe** — Google's face mesh detection
60
+
61
+ ## License
62
+
63
+ Hallo4 is a derivative of WAN2.1-1.3B, governed by the WAN LICENSE.
64
+ Hallo3 is a derivative of CogVideo-5B, released under MIT license.