Papers
arxiv:2305.15084

Audio-Visual Dataset and Method for Anomaly Detection in Traffic Videos

Published on May 24, 2023
Authors:
,
,
,
,
,
,

Abstract

A novel audio-visual dataset for traffic anomaly detection called MAVAD is introduced alongside a cross-attention based method AVACA that integrates visual and audio features, demonstrating improved performance with audio input and minimal performance degradation under image anonymization.

AI-generated summary

We introduce the first audio-visual dataset for traffic anomaly detection taken from real-world scenes, called MAVAD, with a diverse range of weather and illumination conditions. In addition, we propose a novel method named AVACA that combines visual and audio features extracted from video sequences by means of cross-attention to detect anomalies. We demonstrate that the addition of audio improves the performance of AVACA by up to 5.2%. We also evaluate the impact of image anonymization, showing only a minor decrease in performance averaging at 1.7%.

Community

Sign up or log in to comment

Models citing this paper 0

No model linking this paper

Cite arxiv.org/abs/2305.15084 in a model README.md to link it from this page.

Datasets citing this paper 0

No dataset linking this paper

Cite arxiv.org/abs/2305.15084 in a dataset README.md to link it from this page.

Spaces citing this paper 0

No Space linking this paper

Cite arxiv.org/abs/2305.15084 in a Space README.md to link it from this page.

Collections including this paper 0

No Collection including this paper

Add this paper to a collection to link it from this page.