AnthonyGosselin commited on
Commit
eb41b3e
·
verified ·
1 Parent(s): 9ffc2fd

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +15 -5
README.md CHANGED
@@ -6,7 +6,7 @@ base_model:
6
 
7
  # Model Card for Ctrl-Crash
8
 
9
- Generate car crash videos from an initial frame and using bounding-box and crash type conditioning.
10
 
11
  <p align="center">
12
  <table cellspacing="0" cellpadding="0">
@@ -26,7 +26,7 @@ Generate car crash videos from an initial frame and using bounding-box and crash
26
  <img src="architecture_figure.png" width=800>
27
  </p>
28
 
29
- TODO: Provide a longer summary of what this model is.
30
 
31
  - Visit the **project page** for demos: https://anthonygosselin.github.io/Ctrl-Crash-ProjectPage/
32
  - Visit the **repository** to get started: https://github.com/CtrlCrash-Anonymous/Ctrl-Crash-Anonymous
@@ -39,13 +39,23 @@ This model uses the Stability AI Image-to-Video model (SVD 1.1) as a base model:
39
 
40
  <!-- Address questions around how the model is intended to be used, including the foreseeable users of the model and those affected by the model. -->
41
 
42
- TODO: Here we can describe the different operation modes (Reconstruction, Prediction and counterfactuals)
 
 
 
 
 
43
 
44
  ## Bias, Risks, and Limitations
45
 
46
- TODO: Limitations of model
 
 
 
 
 
47
 
48
- **BibTeX:**
49
  ```bibtex
50
  @misc{gosselin2025ctrlcrashcontrollablediffusionrealistic,
51
  title={Ctrl-Crash: Controllable Diffusion for Realistic Car Crashes},
 
6
 
7
  # Model Card for Ctrl-Crash
8
 
9
+ Generate car crash videos from an initial frame, using bounding-box and crash type control signals.
10
 
11
  <p align="center">
12
  <table cellspacing="0" cellpadding="0">
 
26
  <img src="architecture_figure.png" width=800>
27
  </p>
28
 
29
+ <!-- TODO: Provide a longer summary of what this model is. -->
30
 
31
  - Visit the **project page** for demos: https://anthonygosselin.github.io/Ctrl-Crash-ProjectPage/
32
  - Visit the **repository** to get started: https://github.com/CtrlCrash-Anonymous/Ctrl-Crash-Anonymous
 
39
 
40
  <!-- Address questions around how the model is intended to be used, including the foreseeable users of the model and those affected by the model. -->
41
 
42
+ <!-- TODO: Here we can describe the different operation modes (Reconstruction, Prediction and counterfactuals) -->
43
+
44
+ Ctrl-Crash supports different task settings, each enabled by varying the available control signals, namely:
45
+ - **(1) Crash Reconstruction**: Given an initial image, full bounding box sequence, and a crash type, the model reconstructs a consistent video combining the visual context of the initial frame with agent motion derived from the bounding boxes.
46
+ - **(2) Crash Prediction**: Given the initial frame and only a few initial bounding box frames (e.g., 0–9), the model predicts the future motion of agents in a way that aligns with the target crash type.
47
+ - **(3) Crash Counterfactuals**: Extending the prediction task, this mode varies the crash type signal while keeping other inputs fixed, enabling the generation of multiple plausible outcomes for the same scene—supporting counterfactual safety reasoning.
48
 
49
  ## Bias, Risks, and Limitations
50
 
51
+ Despite its strong performance, our approach has several limitations, which motivates future work in this direction.
52
+
53
+ - Counterfactual outcomes can be hard to generate when initial scene conditions conflict with the desired crash type.
54
+ - The model relies heavily on bounding boxes, making it sensitive to tracking errors—especially in fully conditioned reconstruction.
55
+ - With no bounding boxes conditioning, motion direction can be ambiguous, and 2D boxes struggle to capture rotation or orientation, limiting realism in behaviors like spinouts
56
+ - Does not support text conditioning
57
 
58
+ **BibTeX:**
59
  ```bibtex
60
  @misc{gosselin2025ctrlcrashcontrollablediffusionrealistic,
61
  title={Ctrl-Crash: Controllable Diffusion for Realistic Car Crashes},