Upload CREDITS.md with huggingface_hub
Browse files- CREDITS.md +64 -0
CREDITS.md
ADDED
|
@@ -0,0 +1,64 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
# EDEN OS V2 — Credits & Acknowledgements
|
| 2 |
+
|
| 3 |
+
## Hallo — Portrait Image Animation
|
| 4 |
+
|
| 5 |
+
EDEN OS V2's face animation system is powered by **Hallo**, developed by the
|
| 6 |
+
**Fudan University Generative Vision Lab** (fudan-generative-vision).
|
| 7 |
+
|
| 8 |
+
We extend our deepest gratitude and thanks to the Hallo team for their
|
| 9 |
+
groundbreaking research in audio-driven portrait animation.
|
| 10 |
+
|
| 11 |
+
### Hallo4: High-Fidelity Dynamic Portrait Animation via Direct Preference Optimization
|
| 12 |
+
- **Paper**: SIGGRAPH Asia 2025
|
| 13 |
+
- **Authors**: Jiahao Cui, Baoyou Chen, Mingwang Xu, Hanlin Shang, Yuxuan Chen,
|
| 14 |
+
Yun Zhan, Zilong Dong, Yao Yao, Jingdong Wang, Siyu Zhu
|
| 15 |
+
- **Institutions**: Fudan University, Baidu Inc, Nanjing University, Alibaba Group
|
| 16 |
+
- **Repository**: https://github.com/fudan-generative-vision/hallo4
|
| 17 |
+
- **Model**: https://huggingface.co/fudan-generative-ai/hallo4
|
| 18 |
+
- **arXiv**: https://arxiv.org/abs/2505.23525
|
| 19 |
+
|
| 20 |
+
### Hallo3: Highly Dynamic and Realistic Portrait Image Animation with Diffusion Transformer Networks
|
| 21 |
+
- **Authors**: Jiahao Cui, Hui Li, Yun Zhan, et al.
|
| 22 |
+
- **Repository**: https://github.com/fudan-generative-vision/hallo3
|
| 23 |
+
- **Model**: https://huggingface.co/fudan-generative-ai/hallo3
|
| 24 |
+
- **arXiv**: https://arxiv.org/abs/2412.00733
|
| 25 |
+
|
| 26 |
+
### Citation
|
| 27 |
+
|
| 28 |
+
```bibtex
|
| 29 |
+
@misc{cui2025hallo4,
|
| 30 |
+
title={Hallo4: High-Fidelity Dynamic Portrait Animation via Direct Preference Optimization},
|
| 31 |
+
author={Jiahao Cui and Baoyou Chen and Mingwang Xu and Hanlin Shang and
|
| 32 |
+
Yuxuan Chen and Yun Zhan and Zilong Dong and Yao Yao and
|
| 33 |
+
Jingdong Wang and Siyu Zhu},
|
| 34 |
+
year={2025},
|
| 35 |
+
eprint={2505.23525},
|
| 36 |
+
archivePrefix={arXiv},
|
| 37 |
+
primaryClass={cs.CV}
|
| 38 |
+
}
|
| 39 |
+
|
| 40 |
+
@misc{cui2024hallo3,
|
| 41 |
+
title={Hallo3: Highly Dynamic and Realistic Portrait Image Animation
|
| 42 |
+
with Diffusion Transformer Networks},
|
| 43 |
+
author={Jiahao Cui and Hui Li and Yun Zhang and Hanlin Shang and
|
| 44 |
+
Kaihui Cheng and Yuqi Ma and Shan Mu and Hang Zhou and
|
| 45 |
+
Jingdong Wang and Siyu Zhu},
|
| 46 |
+
year={2024},
|
| 47 |
+
eprint={2412.00733},
|
| 48 |
+
archivePrefix={arXiv},
|
| 49 |
+
primaryClass={cs.CV}
|
| 50 |
+
}
|
| 51 |
+
```
|
| 52 |
+
|
| 53 |
+
## Additional Technologies
|
| 54 |
+
|
| 55 |
+
- **Edge TTS** — Microsoft Edge Text-to-Speech for Eve's voice
|
| 56 |
+
- **AvatarForcing** — One-step streaming talking avatars (arXiv:2603.14331)
|
| 57 |
+
- **Wav2Vec2** — Facebook's audio encoder (facebook/wav2vec2-base-960h)
|
| 58 |
+
- **WAN2.1** — Base video generation model (Wan-AI/Wan2.1-T2V-1.3B)
|
| 59 |
+
- **MediaPipe** — Google's face mesh detection
|
| 60 |
+
|
| 61 |
+
## License
|
| 62 |
+
|
| 63 |
+
Hallo4 is a derivative of WAN2.1-1.3B, governed by the WAN LICENSE.
|
| 64 |
+
Hallo3 is a derivative of CogVideo-5B, released under MIT license.
|