ConsID-Gen / README.md
mingyang-wu's picture
Update citation to arXiv @misc entry
23b63c0 verified
---
language:
- en
license: apache-2.0
tags:
- image-to-video
- video-generation
- identity-preservation
- view-consistency
- diffusion
- consid-gen
pipeline_tag: image-to-video
library_name: diffsynth
---
# ConsID-Gen
**ConsID-Gen: View-Consistent and Identity-Preserving Image-to-Video Generation**
Mingyang Wu, Ashirbad Mishra, Soumik Dey, Shuo Xing, Naveen Ravipati, Hansi Wu, Binbin Li, Zhengzhong Tu (2026)
Accepted by **CVPR 2026**.
## Summary
This repository contains the model checkpoint for our paper:
**ConsID-Gen: View-Consistent and Identity-Preserving Image-to-Video Generation**.
ConsID-Gen focuses on generating videos that maintain:
- strong identity preservation,
- cross-view consistency,
- temporal coherence.
## Files
- `model.safetensors`: Main model checkpoint.
## Usage
Please refer to the project scripts for training/inference entry points (for example `run_train_considgen.py` and `run_inference_considgen.py`) and adapt paths/configs to your environment.
## Citation
```bibtex
@misc{wu2026considgenviewconsistentidentitypreservingimagetovideo,
title={ConsID-Gen: View-Consistent and Identity-Preserving Image-to-Video Generation},
author={Mingyang Wu and Ashirbad Mishra and Soumik Dey and Shuo Xing and Naveen Ravipati and Hansi Wu and Binbin Li and Zhengzhong Tu},
year={2026},
eprint={2602.10113},
archivePrefix={arXiv},
primaryClass={cs.CV},
url={https://arxiv.org/abs/2602.10113},
}
```