File size: 1,450 Bytes

---
language:
- en
license: apache-2.0
tags:
- image-to-video
- video-generation
- identity-preservation
- view-consistency
- diffusion
- consid-gen
pipeline_tag: image-to-video
library_name: diffsynth
---

# ConsID-Gen

**ConsID-Gen: View-Consistent and Identity-Preserving Image-to-Video Generation**

Mingyang Wu, Ashirbad Mishra, Soumik Dey, Shuo Xing, Naveen Ravipati, Hansi Wu, Binbin Li, Zhengzhong Tu (2026)  
Accepted by **CVPR 2026**.

## Summary

This repository contains the model checkpoint for our paper:

**ConsID-Gen: View-Consistent and Identity-Preserving Image-to-Video Generation**.

ConsID-Gen focuses on generating videos that maintain:

- strong identity preservation,
- cross-view consistency,
- temporal coherence.

## Files

- `model.safetensors`: Main model checkpoint.

## Usage

Please refer to the project scripts for training/inference entry points (for example `run_train_considgen.py` and `run_inference_considgen.py`) and adapt paths/configs to your environment.

## Citation

```bibtex
@misc{wu2026considgenviewconsistentidentitypreservingimagetovideo,
  title={ConsID-Gen: View-Consistent and Identity-Preserving Image-to-Video Generation},
  author={Mingyang Wu and Ashirbad Mishra and Soumik Dey and Shuo Xing and Naveen Ravipati and Hansi Wu and Binbin Li and Zhengzhong Tu},
  year={2026},
  eprint={2602.10113},
  archivePrefix={arXiv},
  primaryClass={cs.CV},
  url={https://arxiv.org/abs/2602.10113},
}
```