File size: 1,450 Bytes
9095228
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
23b63c0
9095228
23b63c0
 
 
 
 
 
9095228
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
---
language:
- en
license: apache-2.0
tags:
- image-to-video
- video-generation
- identity-preservation
- view-consistency
- diffusion
- consid-gen
pipeline_tag: image-to-video
library_name: diffsynth
---

# ConsID-Gen

**ConsID-Gen: View-Consistent and Identity-Preserving Image-to-Video Generation**

Mingyang Wu, Ashirbad Mishra, Soumik Dey, Shuo Xing, Naveen Ravipati, Hansi Wu, Binbin Li, Zhengzhong Tu (2026)  
Accepted by **CVPR 2026**.

## Summary

This repository contains the model checkpoint for our paper:

**ConsID-Gen: View-Consistent and Identity-Preserving Image-to-Video Generation**.

ConsID-Gen focuses on generating videos that maintain:

- strong identity preservation,
- cross-view consistency,
- temporal coherence.

## Files

- `model.safetensors`: Main model checkpoint.

## Usage

Please refer to the project scripts for training/inference entry points (for example `run_train_considgen.py` and `run_inference_considgen.py`) and adapt paths/configs to your environment.

## Citation

```bibtex
@misc{wu2026considgenviewconsistentidentitypreservingimagetovideo,
  title={ConsID-Gen: View-Consistent and Identity-Preserving Image-to-Video Generation},
  author={Mingyang Wu and Ashirbad Mishra and Soumik Dey and Shuo Xing and Naveen Ravipati and Hansi Wu and Binbin Li and Zhengzhong Tu},
  year={2026},
  eprint={2602.10113},
  archivePrefix={arXiv},
  primaryClass={cs.CV},
  url={https://arxiv.org/abs/2602.10113},
}
```