WorldCanvas / README.md
nielsr's picture
nielsr HF Staff
Improve model card: Add pipeline tag, paper, project page, and code links
4c79e3f verified
|
raw
history blame
1.84 kB
metadata
license: cc-by-nc-sa-4.0
pipeline_tag: image-to-video

The World is Your Canvas: Painting Promptable Events with Reference Images, Trajectories, and Text

WorldCanvas is an I2V framework for promptable world events that enables rich, user-directed simulation by combining text, trajectories, and reference images. It allows for the generation of coherent, controllable events that include multi-agent interactions, object entry/exit, reference-guided appearance, and counterintuitive events.

WorldCanvas Demo

Setup and Inference

For detailed setup, checkpoint download, and comprehensive inference instructions (with or without reference images), please refer to the guide on the official GitHub repository. The repository provides command-line steps and Gradio interfaces for generating conditions and videos.

Citation

If you find this work useful, please consider citing our paper:

@article{wang2025worldcanvas,
  title={The World is Your Canvas: Painting Promptable Events with Reference Images, Trajectories, and Text},
  author={Hanlin Wang and Hao Ouyang and Qiuyu Wang and Yue Yu and Yihao Meng and Wen Wang and Ka Leong Cheng and Shuailei Ma and Qingyan Bai and Yixuan Li and Cheng Chen and Yanhong Zeng and Xing Zhu and Yujun Shen and Qifeng Chen},
  journal={arXiv preprint arXiv:2512.16924},
  year={2025}
}