Model Card for Ctrl-Crash

Generate car crash videos from an initial frame, using bounding-box and crash type control signals.

(Above) Examples of generated crashes

Model Details

Visit the project page for demos: https://anthonygosselin.github.io/Ctrl-Crash-ProjectPage/
Visit the repository to get started: https://github.com/AnthonyGosselin/Ctrl-Crash
Read the paper for more details: https://arxiv.org/abs/2506.00227

This model uses the Stability AI Image-to-Video model (SVD 1.1) as a base model: https://huggingface.co/stabilityai/stable-video-diffusion-img2vid-xt-1-1

Uses

Ctrl-Crash supports different task settings, each enabled by varying the available control signals, namely:

(1) Crash Reconstruction: Given an initial image, full bounding box sequence, and a crash type, the model reconstructs a consistent video combining the visual context of the initial frame with agent motion derived from the bounding boxes.
(2) Crash Prediction: Given the initial frame and only a few initial bounding box frames (e.g., 0–9), the model predicts the future motion of agents in a way that aligns with the target crash type.
(3) Crash Counterfactuals: Extending the prediction task, this mode varies the crash type signal while keeping other inputs fixed, enabling the generation of multiple plausible outcomes for the same scene—supporting counterfactual safety reasoning.

Bias, Risks, and Limitations

Despite its strong performance, our approach has several limitations, which motivates future work in this direction.

Counterfactual outcomes can be hard to generate when initial scene conditions conflict with the desired crash type.
The model relies heavily on bounding boxes, making it sensitive to tracking errors—especially in fully conditioned reconstruction.
With no bounding boxes conditioning, motion direction can be ambiguous, and 2D boxes struggle to capture rotation or orientation, limiting realism in behaviors like spinouts
Does not support text conditioning

BibTeX:

@misc{gosselin2025ctrlcrashcontrollablediffusionrealistic,
      title={Ctrl-Crash: Controllable Diffusion for Realistic Car Crashes}, 
      author={Anthony Gosselin and Ge Ya Luo and Luis Lara and Florian Golemo and Derek Nowrouzezahrai and Liam Paull and Alexia Jolicoeur-Martineau and Christopher Pal},
      year={2025},
      eprint={2506.00227},
      archivePrefix={arXiv},
      primaryClass={cs.CV},
      url={https://arxiv.org/abs/2506.00227}, 
}

Downloads last month: -

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for AnthonyGosselin/Ctrl-Crash

Base model

stabilityai/stable-video-diffusion-img2vid-xt-1-1

Finetuned

(3)

this model

Paper for AnthonyGosselin/Ctrl-Crash

Ctrl-Crash: Controllable Diffusion for Realistic Car Crashes

Paper • 2506.00227 • Published May 30, 2025 • 12