mingyang-wu
/

ConsID-Gen

video-generation

identity-preservation

view-consistency

Model card Files Files and versions

ConsID-Gen / README.md

mingyang-wu's picture

Update citation to arXiv @misc entry

23b63c0 verified about 1 month ago

|

history blame contribute delete

1.45 kB

	---
	language:
	- en
	license: apache-2.0
	tags:
	- image-to-video
	- video-generation
	- identity-preservation
	- view-consistency
	- diffusion
	- consid-gen
	pipeline_tag: image-to-video
	library_name: diffsynth
	---

	# ConsID-Gen

	ConsID-Gen: View-Consistent and Identity-Preserving Image-to-Video Generation

	Mingyang Wu, Ashirbad Mishra, Soumik Dey, Shuo Xing, Naveen Ravipati, Hansi Wu, Binbin Li, Zhengzhong Tu (2026)
	Accepted by CVPR 2026.

	## Summary

	This repository contains the model checkpoint for our paper:

	ConsID-Gen: View-Consistent and Identity-Preserving Image-to-Video Generation.

	ConsID-Gen focuses on generating videos that maintain:

	- strong identity preservation,
	- cross-view consistency,
	- temporal coherence.

	## Files

	- `model.safetensors`: Main model checkpoint.

	## Usage

	Please refer to the project scripts for training/inference entry points (for example `run_train_considgen.py` and `run_inference_considgen.py`) and adapt paths/configs to your environment.

	## Citation

	```bibtex
	@misc{wu2026considgenviewconsistentidentitypreservingimagetovideo,
	title={ConsID-Gen: View-Consistent and Identity-Preserving Image-to-Video Generation},
	author={Mingyang Wu and Ashirbad Mishra and Soumik Dey and Shuo Xing and Naveen Ravipati and Hansi Wu and Binbin Li and Zhengzhong Tu},
	year={2026},
	eprint={2602.10113},
	archivePrefix={arXiv},
	primaryClass={cs.CV},
	url={https://arxiv.org/abs/2602.10113},
	}
	```