Update README.md
Browse files
README.md
CHANGED
|
@@ -17,7 +17,9 @@ pipeline_tag: video-text-to-text
|
|
| 17 |
|
| 18 |
TRASER is the video scene graph generation model introduced in **Synthetic Visual Genome 2 (SVG2)**. Given a video and per-object segmentation trajectories, it generates a structured spatio-temporal scene graph describing objects, attributes, and their relations across time.
|
| 19 |
|
| 20 |
-
**Paper:** [Synthetic Visual Genome 2: Extracting Large-scale Spatio-Temporal Scene Graphs from Videos](https://
|
|
|
|
|
|
|
| 21 |
|
| 22 |
**Authors:** Ziqi Gao, Jieyu Zhang, Wisdom Oluchi Ikezogwo, Jae Sung Park, Tario G You, Daniel Ogbu, Chenhao Zheng, Weikai Huang, Yinuo Yang, Quan Kong, Rajat Saini, Ranjay Krishna. (Allen Institute for AI 路 University of Washington 路 Woven by Toyota)
|
| 23 |
|
|
@@ -162,9 +164,13 @@ TRASER is trained on [**SVG2**](https://huggingface.co/datasets/UWGZQ/Synthetic_
|
|
| 162 |
## Citation
|
| 163 |
|
| 164 |
```bibtex
|
| 165 |
-
@
|
| 166 |
-
|
| 167 |
-
|
| 168 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
| 169 |
}
|
| 170 |
```
|
|
|
|
| 17 |
|
| 18 |
TRASER is the video scene graph generation model introduced in **Synthetic Visual Genome 2 (SVG2)**. Given a video and per-object segmentation trajectories, it generates a structured spatio-temporal scene graph describing objects, attributes, and their relations across time.
|
| 19 |
|
| 20 |
+
**Paper:** [Synthetic Visual Genome 2: Extracting Large-scale Spatio-Temporal Scene Graphs from Videos](https://arxiv.org/pdf/2602.23543)
|
| 21 |
+
|
| 22 |
+
**Website:** [Synthetic Visual Genome 2](https://uwgzq.github.io/papers/SVG2/)
|
| 23 |
|
| 24 |
**Authors:** Ziqi Gao, Jieyu Zhang, Wisdom Oluchi Ikezogwo, Jae Sung Park, Tario G You, Daniel Ogbu, Chenhao Zheng, Weikai Huang, Yinuo Yang, Quan Kong, Rajat Saini, Ranjay Krishna. (Allen Institute for AI 路 University of Washington 路 Woven by Toyota)
|
| 25 |
|
|
|
|
| 164 |
## Citation
|
| 165 |
|
| 166 |
```bibtex
|
| 167 |
+
@misc{gao2026syntheticvisualgenome2,
|
| 168 |
+
title={Synthetic Visual Genome 2: Extracting Large-scale Spatio-Temporal Scene Graphs from Videos},
|
| 169 |
+
author={Ziqi Gao and Jieyu Zhang and Wisdom Oluchi Ikezogwo and Jae Sung Park and Tario G. You and Daniel Ogbu and Chenhao Zheng and Weikai Huang and Yinuo Yang and Winson Han and Quan Kong and Rajat Saini and Ranjay Krishna},
|
| 170 |
+
year={2026},
|
| 171 |
+
eprint={2602.23543},
|
| 172 |
+
archivePrefix={arXiv},
|
| 173 |
+
primaryClass={cs.CV},
|
| 174 |
+
url={https://arxiv.org/abs/2602.23543},
|
| 175 |
}
|
| 176 |
```
|