Semantic Segmentation and Depth Estimation with RGB and DVS Sensor Fusion for Multi-view Driving Perception

In this research, we present a novel deep multi-task learning model to handle the perception stage of an autonomous driving system. The model leverages the fusion of RGB and dynamic vision sensor (DVS) images to perform semantic segmentation and depth estimation in four different perspectives of vie...

Full description

Bibliographic Details
Main Authors: Natan, Oskar, Miura, Jun
Format: Other
Published: Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) 2022
Subjects:
_version_ 1797037815113973760
author Natan, Oskar
Miura, Jun
author_facet Natan, Oskar
Miura, Jun
author_sort Natan, Oskar
collection UGM
description In this research, we present a novel deep multi-task learning model to handle the perception stage of an autonomous driving system. The model leverages the fusion of RGB and dynamic vision sensor (DVS) images to perform semantic segmentation and depth estimation in four different perspectives of view simultaneously. As for the experiment, CARLA simulator is used to generate thousands of simulation data for training, validation, and testing processes. A dynamically changing environment with various weather conditions, daytime, maps, and non-player characters (NPC) is also considered to simulate a more realistic condition with expecting a better generalization of the model. An ablation study is conducted by modifying the network architecture to evaluate the influence of the sensor fusion technique. Based on the test result on 2 different datasets, the model that leverages feature maps sharing from RGB and DVS encoders is performing better. Furthermore, we show that our model can inference faster and have a comparable performance against another recent model. Official implementation code is shared at https://github.com/oskarnatan/RGBDVS-fusion.
first_indexed 2024-03-14T00:09:43Z
format Other
id oai:generic.eprints.org:284221
institution Universiti Gadjah Mada
last_indexed 2024-03-14T00:09:43Z
publishDate 2022
publisher Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
record_format dspace
spelling oai:generic.eprints.org:2842212023-12-05T07:36:23Z https://repository.ugm.ac.id/284221/ Semantic Segmentation and Depth Estimation with RGB and DVS Sensor Fusion for Multi-view Driving Perception Natan, Oskar Miura, Jun Artificial Intelligence and Image Processing not elsewhere classified In this research, we present a novel deep multi-task learning model to handle the perception stage of an autonomous driving system. The model leverages the fusion of RGB and dynamic vision sensor (DVS) images to perform semantic segmentation and depth estimation in four different perspectives of view simultaneously. As for the experiment, CARLA simulator is used to generate thousands of simulation data for training, validation, and testing processes. A dynamically changing environment with various weather conditions, daytime, maps, and non-player characters (NPC) is also considered to simulate a more realistic condition with expecting a better generalization of the model. An ablation study is conducted by modifying the network architecture to evaluate the influence of the sensor fusion technique. Based on the test result on 2 different datasets, the model that leverages feature maps sharing from RGB and DVS encoders is performing better. Furthermore, we show that our model can inference faster and have a comparable performance against another recent model. Official implementation code is shared at https://github.com/oskarnatan/RGBDVS-fusion. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) 2022 Other NonPeerReviewed Natan, Oskar and Miura, Jun (2022) Semantic Segmentation and Depth Estimation with RGB and DVS Sensor Fusion for Multi-view Driving Perception. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). https://dl.acm.org/doi/10.1007/978-3-031-02375-0_26 10.1007/978-3-031-02375-0_26
spellingShingle Artificial Intelligence and Image Processing not elsewhere classified
Natan, Oskar
Miura, Jun
Semantic Segmentation and Depth Estimation with RGB and DVS Sensor Fusion for Multi-view Driving Perception
title Semantic Segmentation and Depth Estimation with RGB and DVS Sensor Fusion for Multi-view Driving Perception
title_full Semantic Segmentation and Depth Estimation with RGB and DVS Sensor Fusion for Multi-view Driving Perception
title_fullStr Semantic Segmentation and Depth Estimation with RGB and DVS Sensor Fusion for Multi-view Driving Perception
title_full_unstemmed Semantic Segmentation and Depth Estimation with RGB and DVS Sensor Fusion for Multi-view Driving Perception
title_short Semantic Segmentation and Depth Estimation with RGB and DVS Sensor Fusion for Multi-view Driving Perception
title_sort semantic segmentation and depth estimation with rgb and dvs sensor fusion for multi view driving perception
topic Artificial Intelligence and Image Processing not elsewhere classified
work_keys_str_mv AT natanoskar semanticsegmentationanddepthestimationwithrgbanddvssensorfusionformultiviewdrivingperception
AT miurajun semanticsegmentationanddepthestimationwithrgbanddvssensorfusionformultiviewdrivingperception