--- language: - en license: apache-2.0 tags: - image-to-video - video-generation - identity-preservation - view-consistency - diffusion - consid-gen pipeline_tag: image-to-video library_name: diffsynth --- # ConsID-Gen **ConsID-Gen: View-Consistent and Identity-Preserving Image-to-Video Generation** Mingyang Wu, Ashirbad Mishra, Soumik Dey, Shuo Xing, Naveen Ravipati, Hansi Wu, Binbin Li, Zhengzhong Tu (2026) Accepted by **CVPR 2026**. ## Summary This repository contains the model checkpoint for our paper: **ConsID-Gen: View-Consistent and Identity-Preserving Image-to-Video Generation**. ConsID-Gen focuses on generating videos that maintain: - strong identity preservation, - cross-view consistency, - temporal coherence. ## Files - `model.safetensors`: Main model checkpoint. ## Usage Please refer to the project scripts for training/inference entry points (for example `run_train_considgen.py` and `run_inference_considgen.py`) and adapt paths/configs to your environment. ## Citation ```bibtex @misc{wu2026considgenviewconsistentidentitypreservingimagetovideo, title={ConsID-Gen: View-Consistent and Identity-Preserving Image-to-Video Generation}, author={Mingyang Wu and Ashirbad Mishra and Soumik Dey and Shuo Xing and Naveen Ravipati and Hansi Wu and Binbin Li and Zhengzhong Tu}, year={2026}, eprint={2602.10113}, archivePrefix={arXiv}, primaryClass={cs.CV}, url={https://arxiv.org/abs/2602.10113}, } ```