File size: 1,284 Bytes
72ec05b 2b5bb7d 72ec05b bb28eb2 72ec05b f701258 72ec05b 5ae6b43 72ec05b 5ae6b43 72ec05b 5ae6b43 72ec05b 4076f56 5ae6b43 72ec05b 5ae6b43 | 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 | ---
license: openrail++
datasets:
- krahets/dna_rendering_processed
base_model:
- stabilityai/stable-diffusion-2-1-base
pipeline_tag: video-to-video
tags:
- 3d-generation
- 4d-generation
- human
- avatar
- multi-view video
---
# Diffuman4D Model
[**Project Page**](https://diffuman4d.github.io/) | [**Paper**](https://arxiv.org/abs/2507.13344) | [**Code**](https://github.com/zju3dv/Diffuman4D) | [**Dataset**](https://huggingface.co/datasets/krahets/dna_rendering_processed)
> The official model repo for Diffuman4D: 4D Consistent Human View Synthesis from Sparse-View Videos with Spatio-Temporal Diffusion Models.
<img src="assets/images/teaser_dna.gif" width="100%" alt="teaser">
Diffuman4D enables high-fidelity free-viewpoint rendering of human performances from sparse-view videos.
## Usage
See the [GitHub repo](https://github.com/zju3dv/Diffuman4D) for detailed usage.
## Cite
```
@inproceedings{jin2025diffuman4d,
title={Diffuman4D: 4D Consistent Human View Synthesis from Sparse-View Videos with Spatio-Temporal Diffusion Models},
author={Jin, Yudong and Peng, Sida and Wang, Xuan and Xie, Tao and Xu, Zhen and Yang, Yifan and Shen, Yujun and Bao, Hujun and Zhou, Xiaowei},
booktitle={International Conference on Computer Vision (ICCV)},
year={2025}
}
```
|