3D Point Cloud Object Detection Algorithm Based on Temporal Information Fusion and Uncertainty Estimation

In autonomous driving, LiDAR (light detection and ranging) data are acquired over time. Most existing 3D object detection algorithms propose the object bounding box by processing each frame of data independently, which ignores the temporal sequence information. However, the temporal sequence informa...

Full description

Bibliographic Details
Main Authors:	Guangda Xie, Yang Li, Yanping Wang, Ziyi Li, Hongquan Qu
Format:	Article
Language:	English
Published:	MDPI AG 2023-06-01
Series:	Remote Sensing
Subjects:	point cloud 3D object detection GRU positioning uncertainty
Online Access:	https://www.mdpi.com/2072-4292/15/12/2986

_version_	1797592818245435392
author	Guangda Xie Yang Li Yanping Wang Ziyi Li Hongquan Qu
author_facet	Guangda Xie Yang Li Yanping Wang Ziyi Li Hongquan Qu
author_sort	Guangda Xie
collection	DOAJ
description	In autonomous driving, LiDAR (light detection and ranging) data are acquired over time. Most existing 3D object detection algorithms propose the object bounding box by processing each frame of data independently, which ignores the temporal sequence information. However, the temporal sequence information is usually helpful to detect the object with missing shape information due to long distance or occlusion. To address this problem, we propose a temporal sequence information fusion 3D point cloud object detection algorithm based on the Ada-GRU (adaptive gated recurrent unit). In this method, the feature of each frame for the LiDAR point cloud is extracted through the backbone network and is fed to the Ada-GRU together with the hidden features of the previous frames. Compared to the traditional GRU, the Ada-GRU can adjust the gating mechanism adaptively during the training process by introducing the adaptive activation function. The Ada-GRU outputs the temporal sequence fusion features to predict the 3D object in the current frame and transmits the hidden features of the current frame to the next frame. At the same time, the label uncertainty of the distant and occluded objects affects the training effect of the model. For this problem, this paper proposes a probability distribution model of 3D bounding box coordinates based on the Gaussian distribution function and designs the corresponding bounding box loss function to enable the model to learn and estimate the uncertainty of the positioning of the bounding box coordinates, so as to remove the bounding box with large positioning uncertainty in the post-processing stage to reduce the false positive rate. Finally, the experiments show that the methods proposed in this paper improve the accuracy of the object detection without significantly increasing the complexity of the algorithm.
first_indexed	2024-03-11T01:59:29Z
format	Article
id	doaj.art-f7b0df136aea4d1baa1b32adf1a463c4
institution	Directory Open Access Journal
issn	2072-4292
language	English
last_indexed	2024-03-11T01:59:29Z
publishDate	2023-06-01
publisher	MDPI AG
record_format	Article
series	Remote Sensing
spelling	doaj.art-f7b0df136aea4d1baa1b32adf1a463c42023-11-18T12:24:43ZengMDPI AGRemote Sensing2072-42922023-06-011512298610.3390/rs151229863D Point Cloud Object Detection Algorithm Based on Temporal Information Fusion and Uncertainty EstimationGuangda Xie0Yang Li1Yanping Wang2Ziyi Li3Hongquan Qu4College of Electrical and Control Engineering, North China University of Technology, Beijing 100144, ChinaCollege of Information, North China University of Technology, Beijing 100144, ChinaCollege of Information, North China University of Technology, Beijing 100144, ChinaCollege of Electrical and Control Engineering, North China University of Technology, Beijing 100144, ChinaCollege of Information, North China University of Technology, Beijing 100144, ChinaIn autonomous driving, LiDAR (light detection and ranging) data are acquired over time. Most existing 3D object detection algorithms propose the object bounding box by processing each frame of data independently, which ignores the temporal sequence information. However, the temporal sequence information is usually helpful to detect the object with missing shape information due to long distance or occlusion. To address this problem, we propose a temporal sequence information fusion 3D point cloud object detection algorithm based on the Ada-GRU (adaptive gated recurrent unit). In this method, the feature of each frame for the LiDAR point cloud is extracted through the backbone network and is fed to the Ada-GRU together with the hidden features of the previous frames. Compared to the traditional GRU, the Ada-GRU can adjust the gating mechanism adaptively during the training process by introducing the adaptive activation function. The Ada-GRU outputs the temporal sequence fusion features to predict the 3D object in the current frame and transmits the hidden features of the current frame to the next frame. At the same time, the label uncertainty of the distant and occluded objects affects the training effect of the model. For this problem, this paper proposes a probability distribution model of 3D bounding box coordinates based on the Gaussian distribution function and designs the corresponding bounding box loss function to enable the model to learn and estimate the uncertainty of the positioning of the bounding box coordinates, so as to remove the bounding box with large positioning uncertainty in the post-processing stage to reduce the false positive rate. Finally, the experiments show that the methods proposed in this paper improve the accuracy of the object detection without significantly increasing the complexity of the algorithm.https://www.mdpi.com/2072-4292/15/12/2986point cloud3D object detectionGRUpositioning uncertainty
spellingShingle	Guangda Xie Yang Li Yanping Wang Ziyi Li Hongquan Qu 3D Point Cloud Object Detection Algorithm Based on Temporal Information Fusion and Uncertainty Estimation Remote Sensing point cloud 3D object detection GRU positioning uncertainty
title	3D Point Cloud Object Detection Algorithm Based on Temporal Information Fusion and Uncertainty Estimation
title_full	3D Point Cloud Object Detection Algorithm Based on Temporal Information Fusion and Uncertainty Estimation
title_fullStr	3D Point Cloud Object Detection Algorithm Based on Temporal Information Fusion and Uncertainty Estimation
title_full_unstemmed	3D Point Cloud Object Detection Algorithm Based on Temporal Information Fusion and Uncertainty Estimation
title_short	3D Point Cloud Object Detection Algorithm Based on Temporal Information Fusion and Uncertainty Estimation
title_sort	3d point cloud object detection algorithm based on temporal information fusion and uncertainty estimation
topic	point cloud 3D object detection GRU positioning uncertainty
url	https://www.mdpi.com/2072-4292/15/12/2986
work_keys_str_mv	AT guangdaxie 3dpointcloudobjectdetectionalgorithmbasedontemporalinformationfusionanduncertaintyestimation AT yangli 3dpointcloudobjectdetectionalgorithmbasedontemporalinformationfusionanduncertaintyestimation AT yanpingwang 3dpointcloudobjectdetectionalgorithmbasedontemporalinformationfusionanduncertaintyestimation AT ziyili 3dpointcloudobjectdetectionalgorithmbasedontemporalinformationfusionanduncertaintyestimation AT hongquanqu 3dpointcloudobjectdetectionalgorithmbasedontemporalinformationfusionanduncertaintyestimation

3D Point Cloud Object Detection Algorithm Based on Temporal Information Fusion and Uncertainty Estimation

Similar Items