A3CarScene: An audio-visual dataset for driving scene understanding

Accurate perception and awareness of the environment surrounding the automobile is a challenge in automotive research. This article presents A3CarScene, a dataset recorded while driving a research vehicle equipped with audio and video sensors on public roads in the Marche Region, Italy. The sensor s...

Full description

Bibliographic Details
Main Authors: Michela Cantarini, Leonardo Gabrielli, Adriano Mancini, Stefano Squartini, Roberto Longo
Format: Article
Language:English
Published: Elsevier 2023-06-01
Series:Data in Brief
Subjects:
Online Access:http://www.sciencedirect.com/science/article/pii/S2352340923002652
_version_ 1797798087390920704
author Michela Cantarini
Leonardo Gabrielli
Adriano Mancini
Stefano Squartini
Roberto Longo
author_facet Michela Cantarini
Leonardo Gabrielli
Adriano Mancini
Stefano Squartini
Roberto Longo
author_sort Michela Cantarini
collection DOAJ
description Accurate perception and awareness of the environment surrounding the automobile is a challenge in automotive research. This article presents A3CarScene, a dataset recorded while driving a research vehicle equipped with audio and video sensors on public roads in the Marche Region, Italy. The sensor suite includes eight microphones installed inside and outside the passenger compartment and two dashcams mounted on the front and rear windows. Approximately 31 h of data for each device were collected during October and November 2022 by driving about 1500 km along diverse roads and landscapes, in variable weather conditions, in daytime and nighttime hours. All key information for the scene understanding process of automated vehicles has been accurately annotated. For each route, annotations with beginning and end timestamps report the type of road traveled (motorway, trunk, primary, secondary, tertiary, residential, and service roads), the degree of urbanization of the area (city, town, suburban area, village, exurban and rural areas), the weather conditions (clear, cloudy, overcast, and rainy), the level of lighting (daytime, evening, night, and tunnel), the type (asphalt or cobblestones) and moisture status (dry or wet) of the road pavement, and the state of the windows (open or closed).This large-scale dataset is valuable for developing new driving assistance technologies based on audio or video data alone or in a multimodal manner and for improving the performance of systems currently in use. The data acquisition process with sensors in multiple locations allows for the assessment of the best installation placement concerning the task. Deep learning engineers can use this dataset to build new baselines, as a comparative benchmark, and to extend existing databases for autonomous driving.
first_indexed 2024-03-13T03:58:11Z
format Article
id doaj.art-955b0e510f4346a99967a760424df521
institution Directory Open Access Journal
issn 2352-3409
language English
last_indexed 2024-03-13T03:58:11Z
publishDate 2023-06-01
publisher Elsevier
record_format Article
series Data in Brief
spelling doaj.art-955b0e510f4346a99967a760424df5212023-06-22T05:03:45ZengElsevierData in Brief2352-34092023-06-0148109146A3CarScene: An audio-visual dataset for driving scene understandingMichela Cantarini0Leonardo Gabrielli1Adriano Mancini2Stefano Squartini3Roberto Longo4Department of Information Engineering, Università Politecnica delle Marche, via Brecce Bianche 12, 60131 Ancona, Italy; Corresponding author.Department of Information Engineering, Università Politecnica delle Marche, via Brecce Bianche 12, 60131 Ancona, ItalyDepartment of Information Engineering, Università Politecnica delle Marche, via Brecce Bianche 12, 60131 Ancona, ItalyDepartment of Information Engineering, Università Politecnica delle Marche, via Brecce Bianche 12, 60131 Ancona, ItalyGroupe Signal Image et Instrumentation (GSII), École Supérieure d’Électronique de l'Ouest (ESEO), 10 Bd Jeanneteau, 49107 Angers, France; Laboratoire d'Acoustique de l'Université du Mans (LAUM), UMR 6613, Institut d'Acoustique - Graduate School (IA-GS), CNRS, Le Mans Université, Av. Olivier Messiaen, 72085 Le Mans, FranceAccurate perception and awareness of the environment surrounding the automobile is a challenge in automotive research. This article presents A3CarScene, a dataset recorded while driving a research vehicle equipped with audio and video sensors on public roads in the Marche Region, Italy. The sensor suite includes eight microphones installed inside and outside the passenger compartment and two dashcams mounted on the front and rear windows. Approximately 31 h of data for each device were collected during October and November 2022 by driving about 1500 km along diverse roads and landscapes, in variable weather conditions, in daytime and nighttime hours. All key information for the scene understanding process of automated vehicles has been accurately annotated. For each route, annotations with beginning and end timestamps report the type of road traveled (motorway, trunk, primary, secondary, tertiary, residential, and service roads), the degree of urbanization of the area (city, town, suburban area, village, exurban and rural areas), the weather conditions (clear, cloudy, overcast, and rainy), the level of lighting (daytime, evening, night, and tunnel), the type (asphalt or cobblestones) and moisture status (dry or wet) of the road pavement, and the state of the windows (open or closed).This large-scale dataset is valuable for developing new driving assistance technologies based on audio or video data alone or in a multimodal manner and for improving the performance of systems currently in use. The data acquisition process with sensors in multiple locations allows for the assessment of the best installation placement concerning the task. Deep learning engineers can use this dataset to build new baselines, as a comparative benchmark, and to extend existing databases for autonomous driving.http://www.sciencedirect.com/science/article/pii/S2352340923002652Acoustic and visual scene classificationAudio signal processingComputer visionAdvanced driver assistance systemsAutonomous vehiclesArtificial neural networks
spellingShingle Michela Cantarini
Leonardo Gabrielli
Adriano Mancini
Stefano Squartini
Roberto Longo
A3CarScene: An audio-visual dataset for driving scene understanding
Data in Brief
Acoustic and visual scene classification
Audio signal processing
Computer vision
Advanced driver assistance systems
Autonomous vehicles
Artificial neural networks
title A3CarScene: An audio-visual dataset for driving scene understanding
title_full A3CarScene: An audio-visual dataset for driving scene understanding
title_fullStr A3CarScene: An audio-visual dataset for driving scene understanding
title_full_unstemmed A3CarScene: An audio-visual dataset for driving scene understanding
title_short A3CarScene: An audio-visual dataset for driving scene understanding
title_sort a3carscene an audio visual dataset for driving scene understanding
topic Acoustic and visual scene classification
Audio signal processing
Computer vision
Advanced driver assistance systems
Autonomous vehicles
Artificial neural networks
url http://www.sciencedirect.com/science/article/pii/S2352340923002652
work_keys_str_mv AT michelacantarini a3carsceneanaudiovisualdatasetfordrivingsceneunderstanding
AT leonardogabrielli a3carsceneanaudiovisualdatasetfordrivingsceneunderstanding
AT adrianomancini a3carsceneanaudiovisualdatasetfordrivingsceneunderstanding
AT stefanosquartini a3carsceneanaudiovisualdatasetfordrivingsceneunderstanding
AT robertolongo a3carsceneanaudiovisualdatasetfordrivingsceneunderstanding