Efficient Attention-Convolution Feature Extractor in Semantic Segmentation for Autonomous Driving Systems

Deep learning has been widely used in computer vision applications and it has been shown to achieve state-of-the-art results in many applications including self-driving cars. Despite the great progress, less attention has been paid to the safety-level importance of different classes and the majority...

Full description

Bibliographic Details
Main Authors:	Seyed-Hamid Mousavi, Mahdi Seyednezhad, Kin-Choong Yow
Format:	Article
Language:	English
Published:	IEEE 2023-01-01
Series:	IEEE Access
Subjects:	Semantic segmentation efficient self-attention cosine similarity score function efficient attention-convolution UNet
Online Access:	https://ieeexplore.ieee.org/document/10285079/

_version_	1797376328463286272
author	Seyed-Hamid Mousavi Mahdi Seyednezhad Kin-Choong Yow
author_facet	Seyed-Hamid Mousavi Mahdi Seyednezhad Kin-Choong Yow
author_sort	Seyed-Hamid Mousavi
collection	DOAJ
description	Deep learning has been widely used in computer vision applications and it has been shown to achieve state-of-the-art results in many applications including self-driving cars. Despite the great progress, less attention has been paid to the safety-level importance of different classes and the majority of the models treat all classes similarly, and only average precision is considered. However, different classes contribute differently to the reliability and safety level of an autonomous driving system i.e. the Person class should be of higher priority than the Sky class in terms of segmentation accuracy. So, in this work, we introduced a new Attention-Convolution Block (ACB) feature extractor with modified self-attention, which can extract detailed and long-range information from the input feature maps and feed the entire network with more focused feature maps. Based on this feature extractor, we developed two models for semantic segmentation that have a balanced trade-off between complexity and accuracy and can accurately distinguish important classes, like the Person class. To demonstrate the performance of our models, we ran our experiments on Cityscapes datasets, and used both quantitative (mean and per-class IoU score) and qualitative (visual representation of output segmentation maps) measures and compared the results of our model with the state-of-the-art methods. The results show that our proposed model improves Per-class IoU scores for Person and Bike classes at least by 7 percent. In addition, we compared the accuracy of different models against their complexity to show despite simple structure and low parameter number, our proposed model has high IoU accuracy.
first_indexed	2024-03-08T19:36:58Z
format	Article
id	doaj.art-0ae933d1a70a46e3bb1b415f811631e8
institution	Directory Open Access Journal
issn	2169-3536
language	English
last_indexed	2024-03-08T19:36:58Z
publishDate	2023-01-01
publisher	IEEE
record_format	Article
series	IEEE Access
spelling	doaj.art-0ae933d1a70a46e3bb1b415f811631e82023-12-26T00:07:41ZengIEEEIEEE Access2169-35362023-01-011114214614216110.1109/ACCESS.2023.332460010285079Efficient Attention-Convolution Feature Extractor in Semantic Segmentation for Autonomous Driving SystemsSeyed-Hamid Mousavi0Mahdi Seyednezhad1Kin-Choong Yow2https://orcid.org/0000-0002-8610-661XFaculty of Engineering and Applied Sciences, University of Regina, Regina, CanadaDepartment of Computer Engineering and Sciences, Florida Institute of Technology, Melbourne, FL, USAFaculty of Engineering and Applied Sciences, University of Regina, Regina, CanadaDeep learning has been widely used in computer vision applications and it has been shown to achieve state-of-the-art results in many applications including self-driving cars. Despite the great progress, less attention has been paid to the safety-level importance of different classes and the majority of the models treat all classes similarly, and only average precision is considered. However, different classes contribute differently to the reliability and safety level of an autonomous driving system i.e. the Person class should be of higher priority than the Sky class in terms of segmentation accuracy. So, in this work, we introduced a new Attention-Convolution Block (ACB) feature extractor with modified self-attention, which can extract detailed and long-range information from the input feature maps and feed the entire network with more focused feature maps. Based on this feature extractor, we developed two models for semantic segmentation that have a balanced trade-off between complexity and accuracy and can accurately distinguish important classes, like the Person class. To demonstrate the performance of our models, we ran our experiments on Cityscapes datasets, and used both quantitative (mean and per-class IoU score) and qualitative (visual representation of output segmentation maps) measures and compared the results of our model with the state-of-the-art methods. The results show that our proposed model improves Per-class IoU scores for Person and Bike classes at least by 7 percent. In addition, we compared the accuracy of different models against their complexity to show despite simple structure and low parameter number, our proposed model has high IoU accuracy.https://ieeexplore.ieee.org/document/10285079/Semantic segmentationefficient self-attentioncosine similarity score functionefficient attention-convolution UNet
spellingShingle	Seyed-Hamid Mousavi Mahdi Seyednezhad Kin-Choong Yow Efficient Attention-Convolution Feature Extractor in Semantic Segmentation for Autonomous Driving Systems IEEE Access Semantic segmentation efficient self-attention cosine similarity score function efficient attention-convolution UNet
title	Efficient Attention-Convolution Feature Extractor in Semantic Segmentation for Autonomous Driving Systems
title_full	Efficient Attention-Convolution Feature Extractor in Semantic Segmentation for Autonomous Driving Systems
title_fullStr	Efficient Attention-Convolution Feature Extractor in Semantic Segmentation for Autonomous Driving Systems
title_full_unstemmed	Efficient Attention-Convolution Feature Extractor in Semantic Segmentation for Autonomous Driving Systems
title_short	Efficient Attention-Convolution Feature Extractor in Semantic Segmentation for Autonomous Driving Systems
title_sort	efficient attention convolution feature extractor in semantic segmentation for autonomous driving systems
topic	Semantic segmentation efficient self-attention cosine similarity score function efficient attention-convolution UNet
url	https://ieeexplore.ieee.org/document/10285079/
work_keys_str_mv	AT seyedhamidmousavi efficientattentionconvolutionfeatureextractorinsemanticsegmentationforautonomousdrivingsystems AT mahdiseyednezhad efficientattentionconvolutionfeatureextractorinsemanticsegmentationforautonomousdrivingsystems AT kinchoongyow efficientattentionconvolutionfeatureextractorinsemanticsegmentationforautonomousdrivingsystems

Efficient Attention-Convolution Feature Extractor in Semantic Segmentation for Autonomous Driving Systems

Similar Items