TartanAviation: Image, Speech, and ADS-B Trajectory Datasets for Terminal Airspace Operations
Abstract
TartanAviation is an open-source multi-modal dataset combining images, speech, and ADS-B trajectory data from airport environments, designed to support AI integration in air traffic control and autonomous aircraft development.
We introduce TartanAviation, an open-source multi-modal dataset focused on terminal-area airspace operations. TartanAviation provides a holistic view of the airport environment by concurrently collecting image, speech, and ADS-B trajectory data using setups installed inside airport boundaries. The datasets were collected at both towered and non-towered airfields across multiple months to capture diversity in aircraft operations, seasons, aircraft types, and weather conditions. In total, TartanAviation provides 3.1M images, 3374 hours of Air Traffic Control speech data, and 661 days of ADS-B trajectory data. The data was filtered, processed, and validated to create a curated dataset. In addition to the dataset, we also open-source the code-base used to collect and pre-process the dataset, further enhancing accessibility and usability. We believe this dataset has many potential use cases and would be particularly vital in allowing AI and machine learning technologies to be integrated into air traffic control systems and advance the adoption of autonomous aircraft in the airspace.
Get this paper in your agent:
hf papers read 2403.03372 Don't have the latest CLI?
curl -LsSf https://hf.co/cli/install.sh | bash Models citing this paper 0
No model linking this paper
Datasets citing this paper 2
twangodev/tartanaviation-atc-adsb
Spaces citing this paper 0
No Space linking this paper
Collections including this paper 0
No Collection including this paper