A3CarScene: An audio-visual dataset for driving scene understanding

Accurate perception and awareness of the environment surrounding the automobile is a challenge in automotive research. This article presents A3CarScene, a dataset recorded while driving a research vehicle equipped with audio and video sensors on public roads in the Marche Region, Italy. The sensor s...

Full description

Bibliographic Details
Main Authors:	Michela Cantarini, Leonardo Gabrielli, Adriano Mancini, Stefano Squartini, Roberto Longo
Format:	Article
Language:	English
Published:	Elsevier 2023-06-01
Series:	Data in Brief
Subjects:	Acoustic and visual scene classification Audio signal processing Computer vision Advanced driver assistance systems Autonomous vehicles Artificial neural networks
Online Access:	http://www.sciencedirect.com/science/article/pii/S2352340923002652

_version_	1797798087390920704
author	Michela Cantarini Leonardo Gabrielli Adriano Mancini Stefano Squartini Roberto Longo
author_facet	Michela Cantarini Leonardo Gabrielli Adriano Mancini Stefano Squartini Roberto Longo
author_sort	Michela Cantarini
collection	DOAJ
description	Accurate perception and awareness of the environment surrounding the automobile is a challenge in automotive research. This article presents A3CarScene, a dataset recorded while driving a research vehicle equipped with audio and video sensors on public roads in the Marche Region, Italy. The sensor suite includes eight microphones installed inside and outside the passenger compartment and two dashcams mounted on the front and rear windows. Approximately 31 h of data for each device were collected during October and November 2022 by driving about 1500 km along diverse roads and landscapes, in variable weather conditions, in daytime and nighttime hours. All key information for the scene understanding process of automated vehicles has been accurately annotated. For each route, annotations with beginning and end timestamps report the type of road traveled (motorway, trunk, primary, secondary, tertiary, residential, and service roads), the degree of urbanization of the area (city, town, suburban area, village, exurban and rural areas), the weather conditions (clear, cloudy, overcast, and rainy), the level of lighting (daytime, evening, night, and tunnel), the type (asphalt or cobblestones) and moisture status (dry or wet) of the road pavement, and the state of the windows (open or closed).This large-scale dataset is valuable for developing new driving assistance technologies based on audio or video data alone or in a multimodal manner and for improving the performance of systems currently in use. The data acquisition process with sensors in multiple locations allows for the assessment of the best installation placement concerning the task. Deep learning engineers can use this dataset to build new baselines, as a comparative benchmark, and to extend existing databases for autonomous driving.
first_indexed	2024-03-13T03:58:11Z
format	Article
id	doaj.art-955b0e510f4346a99967a760424df521
institution	Directory Open Access Journal
issn	2352-3409
language	English
last_indexed	2024-03-13T03:58:11Z
publishDate	2023-06-01
publisher	Elsevier
record_format	Article
series	Data in Brief
spelling	doaj.art-955b0e510f4346a99967a760424df5212023-06-22T05:03:45ZengElsevierData in Brief2352-34092023-06-0148109146A3CarScene: An audio-visual dataset for driving scene understandingMichela Cantarini0Leonardo Gabrielli1Adriano Mancini2Stefano Squartini3Roberto Longo4Department of Information Engineering, Università Politecnica delle Marche, via Brecce Bianche 12, 60131 Ancona, Italy; Corresponding author.Department of Information Engineering, Università Politecnica delle Marche, via Brecce Bianche 12, 60131 Ancona, ItalyDepartment of Information Engineering, Università Politecnica delle Marche, via Brecce Bianche 12, 60131 Ancona, ItalyDepartment of Information Engineering, Università Politecnica delle Marche, via Brecce Bianche 12, 60131 Ancona, ItalyGroupe Signal Image et Instrumentation (GSII), École Supérieure d’Électronique de l'Ouest (ESEO), 10 Bd Jeanneteau, 49107 Angers, France; Laboratoire d'Acoustique de l'Université du Mans (LAUM), UMR 6613, Institut d'Acoustique - Graduate School (IA-GS), CNRS, Le Mans Université, Av. Olivier Messiaen, 72085 Le Mans, FranceAccurate perception and awareness of the environment surrounding the automobile is a challenge in automotive research. This article presents A3CarScene, a dataset recorded while driving a research vehicle equipped with audio and video sensors on public roads in the Marche Region, Italy. The sensor suite includes eight microphones installed inside and outside the passenger compartment and two dashcams mounted on the front and rear windows. Approximately 31 h of data for each device were collected during October and November 2022 by driving about 1500 km along diverse roads and landscapes, in variable weather conditions, in daytime and nighttime hours. All key information for the scene understanding process of automated vehicles has been accurately annotated. For each route, annotations with beginning and end timestamps report the type of road traveled (motorway, trunk, primary, secondary, tertiary, residential, and service roads), the degree of urbanization of the area (city, town, suburban area, village, exurban and rural areas), the weather conditions (clear, cloudy, overcast, and rainy), the level of lighting (daytime, evening, night, and tunnel), the type (asphalt or cobblestones) and moisture status (dry or wet) of the road pavement, and the state of the windows (open or closed).This large-scale dataset is valuable for developing new driving assistance technologies based on audio or video data alone or in a multimodal manner and for improving the performance of systems currently in use. The data acquisition process with sensors in multiple locations allows for the assessment of the best installation placement concerning the task. Deep learning engineers can use this dataset to build new baselines, as a comparative benchmark, and to extend existing databases for autonomous driving.http://www.sciencedirect.com/science/article/pii/S2352340923002652Acoustic and visual scene classificationAudio signal processingComputer visionAdvanced driver assistance systemsAutonomous vehiclesArtificial neural networks
spellingShingle	Michela Cantarini Leonardo Gabrielli Adriano Mancini Stefano Squartini Roberto Longo A3CarScene: An audio-visual dataset for driving scene understanding Data in Brief Acoustic and visual scene classification Audio signal processing Computer vision Advanced driver assistance systems Autonomous vehicles Artificial neural networks
title	A3CarScene: An audio-visual dataset for driving scene understanding
title_full	A3CarScene: An audio-visual dataset for driving scene understanding
title_fullStr	A3CarScene: An audio-visual dataset for driving scene understanding
title_full_unstemmed	A3CarScene: An audio-visual dataset for driving scene understanding
title_short	A3CarScene: An audio-visual dataset for driving scene understanding
title_sort	a3carscene an audio visual dataset for driving scene understanding
topic	Acoustic and visual scene classification Audio signal processing Computer vision Advanced driver assistance systems Autonomous vehicles Artificial neural networks
url	http://www.sciencedirect.com/science/article/pii/S2352340923002652
work_keys_str_mv	AT michelacantarini a3carsceneanaudiovisualdatasetfordrivingsceneunderstanding AT leonardogabrielli a3carsceneanaudiovisualdatasetfordrivingsceneunderstanding AT adrianomancini a3carsceneanaudiovisualdatasetfordrivingsceneunderstanding AT stefanosquartini a3carsceneanaudiovisualdatasetfordrivingsceneunderstanding AT robertolongo a3carsceneanaudiovisualdatasetfordrivingsceneunderstanding

A3CarScene: An audio-visual dataset for driving scene understanding

Similar Items