πŸŽ₯ MemCam πŸŽ₯

Memory-Augmented Camera Control for Consistent Video Generation

IJCNN 2026 | Paper | Project Page | GitHub

Model Description

MemCam is a memory-augmented framework for scene-consistent interactive video generation, built on Wan2.1-T2V-1.3B. It treats previously generated frames as external memory and dynamically retrieves the most viewpoint-relevant frames via co-visibility estimation, enabling faithful scene reconstruction even after 360Β° camera rotations.

Files

File Description
dit_step20000.ckpt MemCam DiT checkpoint (trained 20k steps)

Usage

git clone https://github.com/newhorizon2005/MemCam.git
cd MemCam

# Download this model
huggingface-cli download newhorizon2005/MemCam dit_step20000.ckpt --local-dir models/MemCam

# Run inference
python inference_memcam.py

Citation

@inproceedings{gao2026memcam,
  title     = {MemCam: Memory-Augmented Camera Control for Consistent Video Generation},
  author    = {Gao, Xinhang and Guan, Junlin and Luo, Shuhan and Li, Wenzhuo and Tan, Guanghuan and Wang, Jiacheng},
  booktitle = {International Joint Conference on Neural Networks (IJCNN)},
  year      = {2026}
}
Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. πŸ™‹ Ask for provider support

Model tree for newhorizon2005/MemCam

Finetuned
(35)
this model

Dataset used to train newhorizon2005/MemCam