Mask-SL RCNN: Feature-Enhanced 3D Object Detection Network for Point Clouds

At present, with the original point cloud as input, most of the object detectors use Pointnet++ to extract features of the point cloud based on the Farthest Point Sampling (FPS). However, affected by FPS, feature extraction is incomplete and unstable. Moreover, high-level semantic features lack the...

Full description

Bibliographic Details
Main Authors: Yuanhong Zhong, Guangxia Yang, Dihang Deng, Panliang Tang, Fan Ren
Format: Article
Language:English
Published: IEEE 2023-01-01
Series:IEEE Photonics Journal
Subjects:
Online Access:https://ieeexplore.ieee.org/document/10266671/
Description
Summary:At present, with the original point cloud as input, most of the object detectors use Pointnet++ to extract features of the point cloud based on the Farthest Point Sampling (FPS). However, affected by FPS, feature extraction is incomplete and unstable. Moreover, high-level semantic features lack the internal vertex properties of Regions of Interest (RoI). In order to solve the above problems, we propose the Mask-SL RCNN (Mask-Spherical-neighborhood-global-feature-Layer Region-CNN), a feature-enhanced 3D object detection network. It improves sampling of the farthest point through point-level feature enhancement. In addition, we propose Spherical neighborhood global feature Layer (SL) to supplement the global features and improve the learning ability of network. At last, based on semantic-level feature enhancement, we design grid pooling layer based on vertex attention, which fully explores the boundary characteristics of RoI and increases ability to learn advanced features in RoI. Our network improves detection precision of small objects such as pedestrians. Compared with PointRCNN, it has improved the mAP of simple, medium, and difficult object detection in the KITTI dataset by 2.66%, 1.69%, and 0.67%, respectively.
ISSN:1943-0655