Update README.md
Browse files
README.md
CHANGED
|
@@ -1,6 +1,7 @@
|
|
| 1 |
---
|
| 2 |
language:
|
| 3 |
- en
|
|
|
|
| 4 |
base_model:
|
| 5 |
- Wan-AI/Wan2.1-I2V-14B-480P-Diffusers
|
| 6 |
pipeline_tag: video-to-video
|
|
@@ -10,4 +11,44 @@ tags:
|
|
| 10 |
- exocentric
|
| 11 |
- exo-to-ego
|
| 12 |
- ic-lora
|
| 13 |
-
---
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
---
|
| 2 |
language:
|
| 3 |
- en
|
| 4 |
+
license: mit
|
| 5 |
base_model:
|
| 6 |
- Wan-AI/Wan2.1-I2V-14B-480P-Diffusers
|
| 7 |
pipeline_tag: video-to-video
|
|
|
|
| 11 |
- exocentric
|
| 12 |
- exo-to-ego
|
| 13 |
- ic-lora
|
| 14 |
+
---
|
| 15 |
+
|
| 16 |
+
# EgoX: Egocentric Video Generation from a Single Exocentric Video
|
| 17 |
+
|
| 18 |
+
This repository provides **model weights** of **EgoX**, a video-to-video generation model that synthesizes **egocentric (first-person) videos** from a **single exocentric (third-person) video**.
|
| 19 |
+
EgoX is built on top of a large-scale video diffusion backbone and enables exo-to-ego viewpoint transformation without requiring multi-view inputs.
|
| 20 |
+
|
| 21 |
+
For detailed results, implementation details, and demo videos, please refer to our **[paper](https://arxiv.org/abs/2512.08269)** and **[project repository](https://keh0t0.github.io/EgoX/)**.
|
| 22 |
+
|
| 23 |
+
|
| 24 |
+
---
|
| 25 |
+
|
| 26 |
+
## Usage
|
| 27 |
+
|
| 28 |
+
Please refer to the **[Quick Start](https://github.com/DAVIAN-Robotics/EgoX?tab=readme-ov-file#-inference)** section for instructions on running inference and required preprocessing steps.
|
| 29 |
+
|
| 30 |
+
---
|
| 31 |
+
|
| 32 |
+
## Citation
|
| 33 |
+
|
| 34 |
+
If you find this model or code useful in your research, please cite our paper:
|
| 35 |
+
|
| 36 |
+
```bibtex
|
| 37 |
+
@misc{kang2025egoxegocentricvideogeneration,
|
| 38 |
+
title={EgoX: Egocentric Video Generation from a Single Exocentric Video},
|
| 39 |
+
author={Taewoong Kang and Kinam Kim and Dohyeon Kim and Minho Park and Junha Hyung and Jaegul Choo},
|
| 40 |
+
year={2025},
|
| 41 |
+
eprint={2512.08269},
|
| 42 |
+
archivePrefix={arXiv},
|
| 43 |
+
primaryClass={cs.CV},
|
| 44 |
+
url={https://arxiv.org/abs/2512.08269},
|
| 45 |
+
}
|
| 46 |
+
```
|
| 47 |
+
|
| 48 |
+
## Acknowledgement
|
| 49 |
+
|
| 50 |
+
This work builds upon the valuable open-source efforts of
|
| 51 |
+
[4DNeX](https://github.com/3DTopia/4DNeX) and
|
| 52 |
+
[EgoExo4D](https://github.com/facebookresearch/Ego-Exo).
|
| 53 |
+
|
| 54 |
+
We sincerely appreciate their contributions to the computer vision and robotics communities.
|