A3CarScene: An audio-visual dataset for driving scene understanding
Accurate perception and awareness of the environment surrounding the automobile is a challenge in automotive research. This article presents A3CarScene, a dataset recorded while driving a research vehicle equipped with audio and video sensors on public roads in the Marche Region, Italy. The sensor s...
Main Authors: | , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Elsevier
2023-06-01
|
Series: | Data in Brief |
Subjects: | |
Online Access: | http://www.sciencedirect.com/science/article/pii/S2352340923002652 |
_version_ | 1797798087390920704 |
---|---|
author | Michela Cantarini Leonardo Gabrielli Adriano Mancini Stefano Squartini Roberto Longo |
author_facet | Michela Cantarini Leonardo Gabrielli Adriano Mancini Stefano Squartini Roberto Longo |
author_sort | Michela Cantarini |
collection | DOAJ |
description | Accurate perception and awareness of the environment surrounding the automobile is a challenge in automotive research. This article presents A3CarScene, a dataset recorded while driving a research vehicle equipped with audio and video sensors on public roads in the Marche Region, Italy. The sensor suite includes eight microphones installed inside and outside the passenger compartment and two dashcams mounted on the front and rear windows. Approximately 31 h of data for each device were collected during October and November 2022 by driving about 1500 km along diverse roads and landscapes, in variable weather conditions, in daytime and nighttime hours. All key information for the scene understanding process of automated vehicles has been accurately annotated. For each route, annotations with beginning and end timestamps report the type of road traveled (motorway, trunk, primary, secondary, tertiary, residential, and service roads), the degree of urbanization of the area (city, town, suburban area, village, exurban and rural areas), the weather conditions (clear, cloudy, overcast, and rainy), the level of lighting (daytime, evening, night, and tunnel), the type (asphalt or cobblestones) and moisture status (dry or wet) of the road pavement, and the state of the windows (open or closed).This large-scale dataset is valuable for developing new driving assistance technologies based on audio or video data alone or in a multimodal manner and for improving the performance of systems currently in use. The data acquisition process with sensors in multiple locations allows for the assessment of the best installation placement concerning the task. Deep learning engineers can use this dataset to build new baselines, as a comparative benchmark, and to extend existing databases for autonomous driving. |
first_indexed | 2024-03-13T03:58:11Z |
format | Article |
id | doaj.art-955b0e510f4346a99967a760424df521 |
institution | Directory Open Access Journal |
issn | 2352-3409 |
language | English |
last_indexed | 2024-03-13T03:58:11Z |
publishDate | 2023-06-01 |
publisher | Elsevier |
record_format | Article |
series | Data in Brief |
spelling | doaj.art-955b0e510f4346a99967a760424df5212023-06-22T05:03:45ZengElsevierData in Brief2352-34092023-06-0148109146A3CarScene: An audio-visual dataset for driving scene understandingMichela Cantarini0Leonardo Gabrielli1Adriano Mancini2Stefano Squartini3Roberto Longo4Department of Information Engineering, Università Politecnica delle Marche, via Brecce Bianche 12, 60131 Ancona, Italy; Corresponding author.Department of Information Engineering, Università Politecnica delle Marche, via Brecce Bianche 12, 60131 Ancona, ItalyDepartment of Information Engineering, Università Politecnica delle Marche, via Brecce Bianche 12, 60131 Ancona, ItalyDepartment of Information Engineering, Università Politecnica delle Marche, via Brecce Bianche 12, 60131 Ancona, ItalyGroupe Signal Image et Instrumentation (GSII), École Supérieure d’Électronique de l'Ouest (ESEO), 10 Bd Jeanneteau, 49107 Angers, France; Laboratoire d'Acoustique de l'Université du Mans (LAUM), UMR 6613, Institut d'Acoustique - Graduate School (IA-GS), CNRS, Le Mans Université, Av. Olivier Messiaen, 72085 Le Mans, FranceAccurate perception and awareness of the environment surrounding the automobile is a challenge in automotive research. This article presents A3CarScene, a dataset recorded while driving a research vehicle equipped with audio and video sensors on public roads in the Marche Region, Italy. The sensor suite includes eight microphones installed inside and outside the passenger compartment and two dashcams mounted on the front and rear windows. Approximately 31 h of data for each device were collected during October and November 2022 by driving about 1500 km along diverse roads and landscapes, in variable weather conditions, in daytime and nighttime hours. All key information for the scene understanding process of automated vehicles has been accurately annotated. For each route, annotations with beginning and end timestamps report the type of road traveled (motorway, trunk, primary, secondary, tertiary, residential, and service roads), the degree of urbanization of the area (city, town, suburban area, village, exurban and rural areas), the weather conditions (clear, cloudy, overcast, and rainy), the level of lighting (daytime, evening, night, and tunnel), the type (asphalt or cobblestones) and moisture status (dry or wet) of the road pavement, and the state of the windows (open or closed).This large-scale dataset is valuable for developing new driving assistance technologies based on audio or video data alone or in a multimodal manner and for improving the performance of systems currently in use. The data acquisition process with sensors in multiple locations allows for the assessment of the best installation placement concerning the task. Deep learning engineers can use this dataset to build new baselines, as a comparative benchmark, and to extend existing databases for autonomous driving.http://www.sciencedirect.com/science/article/pii/S2352340923002652Acoustic and visual scene classificationAudio signal processingComputer visionAdvanced driver assistance systemsAutonomous vehiclesArtificial neural networks |
spellingShingle | Michela Cantarini Leonardo Gabrielli Adriano Mancini Stefano Squartini Roberto Longo A3CarScene: An audio-visual dataset for driving scene understanding Data in Brief Acoustic and visual scene classification Audio signal processing Computer vision Advanced driver assistance systems Autonomous vehicles Artificial neural networks |
title | A3CarScene: An audio-visual dataset for driving scene understanding |
title_full | A3CarScene: An audio-visual dataset for driving scene understanding |
title_fullStr | A3CarScene: An audio-visual dataset for driving scene understanding |
title_full_unstemmed | A3CarScene: An audio-visual dataset for driving scene understanding |
title_short | A3CarScene: An audio-visual dataset for driving scene understanding |
title_sort | a3carscene an audio visual dataset for driving scene understanding |
topic | Acoustic and visual scene classification Audio signal processing Computer vision Advanced driver assistance systems Autonomous vehicles Artificial neural networks |
url | http://www.sciencedirect.com/science/article/pii/S2352340923002652 |
work_keys_str_mv | AT michelacantarini a3carsceneanaudiovisualdatasetfordrivingsceneunderstanding AT leonardogabrielli a3carsceneanaudiovisualdatasetfordrivingsceneunderstanding AT adrianomancini a3carsceneanaudiovisualdatasetfordrivingsceneunderstanding AT stefanosquartini a3carsceneanaudiovisualdatasetfordrivingsceneunderstanding AT robertolongo a3carsceneanaudiovisualdatasetfordrivingsceneunderstanding |