File size: 1,450 Bytes
9095228 23b63c0 9095228 23b63c0 9095228 | 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 | ---
language:
- en
license: apache-2.0
tags:
- image-to-video
- video-generation
- identity-preservation
- view-consistency
- diffusion
- consid-gen
pipeline_tag: image-to-video
library_name: diffsynth
---
# ConsID-Gen
**ConsID-Gen: View-Consistent and Identity-Preserving Image-to-Video Generation**
Mingyang Wu, Ashirbad Mishra, Soumik Dey, Shuo Xing, Naveen Ravipati, Hansi Wu, Binbin Li, Zhengzhong Tu (2026)
Accepted by **CVPR 2026**.
## Summary
This repository contains the model checkpoint for our paper:
**ConsID-Gen: View-Consistent and Identity-Preserving Image-to-Video Generation**.
ConsID-Gen focuses on generating videos that maintain:
- strong identity preservation,
- cross-view consistency,
- temporal coherence.
## Files
- `model.safetensors`: Main model checkpoint.
## Usage
Please refer to the project scripts for training/inference entry points (for example `run_train_considgen.py` and `run_inference_considgen.py`) and adapt paths/configs to your environment.
## Citation
```bibtex
@misc{wu2026considgenviewconsistentidentitypreservingimagetovideo,
title={ConsID-Gen: View-Consistent and Identity-Preserving Image-to-Video Generation},
author={Mingyang Wu and Ashirbad Mishra and Soumik Dey and Shuo Xing and Naveen Ravipati and Hansi Wu and Binbin Li and Zhengzhong Tu},
year={2026},
eprint={2602.10113},
archivePrefix={arXiv},
primaryClass={cs.CV},
url={https://arxiv.org/abs/2602.10113},
}
```
|