AIBRUH
/

biteve

Model card Files Files and versions

xet

Community

AIBRUH commited on 16 days ago

Commit

470f6d9

verified ·

1 Parent(s): f794a80

Upload CREDITS.md with huggingface_hub

Browse files

Files changed (1) hide show

CREDITS.md +64 -0

CREDITS.md ADDED Viewed

	@@ -0,0 +1,64 @@

+# EDEN OS V2 — Credits & Acknowledgements
+## Hallo — Portrait Image Animation
+EDEN OS V2's face animation system is powered by **Hallo**, developed by the
+**Fudan University Generative Vision Lab** (fudan-generative-vision).
+We extend our deepest gratitude and thanks to the Hallo team for their
+groundbreaking research in audio-driven portrait animation.
+### Hallo4: High-Fidelity Dynamic Portrait Animation via Direct Preference Optimization
+- **Paper**: SIGGRAPH Asia 2025
+- **Authors**: Jiahao Cui, Baoyou Chen, Mingwang Xu, Hanlin Shang, Yuxuan Chen,
+  Yun Zhan, Zilong Dong, Yao Yao, Jingdong Wang, Siyu Zhu
+- **Institutions**: Fudan University, Baidu Inc, Nanjing University, Alibaba Group
+- **Repository**: https://github.com/fudan-generative-vision/hallo4
+- **Model**: https://huggingface.co/fudan-generative-ai/hallo4
+- **arXiv**: https://arxiv.org/abs/2505.23525
+### Hallo3: Highly Dynamic and Realistic Portrait Image Animation with Diffusion Transformer Networks
+- **Authors**: Jiahao Cui, Hui Li, Yun Zhan, et al.
+- **Repository**: https://github.com/fudan-generative-vision/hallo3
+- **Model**: https://huggingface.co/fudan-generative-ai/hallo3
+- **arXiv**: https://arxiv.org/abs/2412.00733
+### Citation
+```bibtex
+@misc{cui2025hallo4,
+    title={Hallo4: High-Fidelity Dynamic Portrait Animation via Direct Preference Optimization},
+    author={Jiahao Cui and Baoyou Chen and Mingwang Xu and Hanlin Shang and
+            Yuxuan Chen and Yun Zhan and Zilong Dong and Yao Yao and
+            Jingdong Wang and Siyu Zhu},
+    year={2025},
+    eprint={2505.23525},
+    archivePrefix={arXiv},
+    primaryClass={cs.CV}
+}
+@misc{cui2024hallo3,
+    title={Hallo3: Highly Dynamic and Realistic Portrait Image Animation
+           with Diffusion Transformer Networks},
+    author={Jiahao Cui and Hui Li and Yun Zhang and Hanlin Shang and
+            Kaihui Cheng and Yuqi Ma and Shan Mu and Hang Zhou and
+            Jingdong Wang and Siyu Zhu},
+    year={2024},
+    eprint={2412.00733},
+    archivePrefix={arXiv},
+    primaryClass={cs.CV}
+}
+```
+## Additional Technologies
+- **Edge TTS** — Microsoft Edge Text-to-Speech for Eve's voice
+- **AvatarForcing** — One-step streaming talking avatars (arXiv:2603.14331)
+- **Wav2Vec2** — Facebook's audio encoder (facebook/wav2vec2-base-960h)
+- **WAN2.1** — Base video generation model (Wan-AI/Wan2.1-T2V-1.3B)
+- **MediaPipe** — Google's face mesh detection
+## License
+Hallo4 is a derivative of WAN2.1-1.3B, governed by the WAN LICENSE.
+Hallo3 is a derivative of CogVideo-5B, released under MIT license.