Point cloud semantic segmentation based on local feature fusion and multilayer attention network

Abstract Semantic segmentation from a three‐dimensional point cloud is vital in autonomous driving, computer vision, and augmented reality. However, current semantic segmentation does not effectively use the point cloud's local geometric features and contextual information, essential for improv...

Full description

Bibliographic Details
Main Authors:	Junjie Wen, Jie Ma, Yuehua Zhao, Tong Nie, Mengxuan Sun, Ziming Fan
Format:	Article
Language:	English
Published:	Wiley 2024-04-01
Series:	IET Computer Vision
Subjects:	computer vision image segmentation pattern recognition
Online Access:	https://doi.org/10.1049/cvi2.12255

_version_	1827284865284833280
author	Junjie Wen Jie Ma Yuehua Zhao Tong Nie Mengxuan Sun Ziming Fan
author_facet	Junjie Wen Jie Ma Yuehua Zhao Tong Nie Mengxuan Sun Ziming Fan
author_sort	Junjie Wen
collection	DOAJ
description	Abstract Semantic segmentation from a three‐dimensional point cloud is vital in autonomous driving, computer vision, and augmented reality. However, current semantic segmentation does not effectively use the point cloud's local geometric features and contextual information, essential for improving segmentation accuracy. A semantic segmentation network that uses local feature fusion and a multilayer attention mechanism is proposed to address these challenges. Specifically, the authors designed a local feature fusion module to encode the geometric and feature information separately, which fully leverages the point cloud's feature perception and geometric structure representation. Furthermore, the authors designed a multilayer attention pooling module consisting of local attention pooling and cascade attention pooling to extract contextual information. Local attention pooling is used to learn local neighbourhood information, and cascade attention pooling captures contextual information from deeper local neighbourhoods. Finally, an enhanced feature representation of important information is obtained by aggregating the features from the two deep attention pooling methods. Extensive experiments on large‐scale point‐cloud datasets Stanford 3D large‐scale indoor spaces and SemanticKITTI indicate that authors network shows excellent advantages over existing representative methods regarding local geometric feature description and global contextual relationships.
first_indexed	2024-04-24T10:05:36Z
format	Article
id	doaj.art-beaa3c9ae24f4ebfbf3c162cd9cb9eb8
institution	Directory Open Access Journal
issn	1751-9632 1751-9640
language	English
last_indexed	2024-04-24T10:05:36Z
publishDate	2024-04-01
publisher	Wiley
record_format	Article
series	IET Computer Vision
spelling	doaj.art-beaa3c9ae24f4ebfbf3c162cd9cb9eb82024-04-13T04:15:00ZengWileyIET Computer Vision1751-96321751-96402024-04-0118338139210.1049/cvi2.12255Point cloud semantic segmentation based on local feature fusion and multilayer attention networkJunjie Wen0Jie Ma1Yuehua Zhao2Tong Nie3Mengxuan Sun4Ziming Fan5School of Electronics and Information Engineering Hebei University of Technology Tianjin ChinaSchool of Electronics and Information Engineering Hebei University of Technology Tianjin ChinaSchool of Electronics and Information Engineering Hebei University of Technology Tianjin ChinaSchool of Electronics and Information Engineering Hebei University of Technology Tianjin ChinaSchool of Electronics and Information Engineering Hebei University of Technology Tianjin ChinaSchool of Electronics and Information Engineering Hebei University of Technology Tianjin ChinaAbstract Semantic segmentation from a three‐dimensional point cloud is vital in autonomous driving, computer vision, and augmented reality. However, current semantic segmentation does not effectively use the point cloud's local geometric features and contextual information, essential for improving segmentation accuracy. A semantic segmentation network that uses local feature fusion and a multilayer attention mechanism is proposed to address these challenges. Specifically, the authors designed a local feature fusion module to encode the geometric and feature information separately, which fully leverages the point cloud's feature perception and geometric structure representation. Furthermore, the authors designed a multilayer attention pooling module consisting of local attention pooling and cascade attention pooling to extract contextual information. Local attention pooling is used to learn local neighbourhood information, and cascade attention pooling captures contextual information from deeper local neighbourhoods. Finally, an enhanced feature representation of important information is obtained by aggregating the features from the two deep attention pooling methods. Extensive experiments on large‐scale point‐cloud datasets Stanford 3D large‐scale indoor spaces and SemanticKITTI indicate that authors network shows excellent advantages over existing representative methods regarding local geometric feature description and global contextual relationships.https://doi.org/10.1049/cvi2.12255computer visionimage segmentationpattern recognition
spellingShingle	Junjie Wen Jie Ma Yuehua Zhao Tong Nie Mengxuan Sun Ziming Fan Point cloud semantic segmentation based on local feature fusion and multilayer attention network IET Computer Vision computer vision image segmentation pattern recognition
title	Point cloud semantic segmentation based on local feature fusion and multilayer attention network
title_full	Point cloud semantic segmentation based on local feature fusion and multilayer attention network
title_fullStr	Point cloud semantic segmentation based on local feature fusion and multilayer attention network
title_full_unstemmed	Point cloud semantic segmentation based on local feature fusion and multilayer attention network
title_short	Point cloud semantic segmentation based on local feature fusion and multilayer attention network
title_sort	point cloud semantic segmentation based on local feature fusion and multilayer attention network
topic	computer vision image segmentation pattern recognition
url	https://doi.org/10.1049/cvi2.12255
work_keys_str_mv	AT junjiewen pointcloudsemanticsegmentationbasedonlocalfeaturefusionandmultilayerattentionnetwork AT jiema pointcloudsemanticsegmentationbasedonlocalfeaturefusionandmultilayerattentionnetwork AT yuehuazhao pointcloudsemanticsegmentationbasedonlocalfeaturefusionandmultilayerattentionnetwork AT tongnie pointcloudsemanticsegmentationbasedonlocalfeaturefusionandmultilayerattentionnetwork AT mengxuansun pointcloudsemanticsegmentationbasedonlocalfeaturefusionandmultilayerattentionnetwork AT zimingfan pointcloudsemanticsegmentationbasedonlocalfeaturefusionandmultilayerattentionnetwork

Point cloud semantic segmentation based on local feature fusion and multilayer attention network

Similar Items