Efficient Attention-Convolution Feature Extractor in Semantic Segmentation for Autonomous Driving Systems

Deep learning has been widely used in computer vision applications and it has been shown to achieve state-of-the-art results in many applications including self-driving cars. Despite the great progress, less attention has been paid to the safety-level importance of different classes and the majority...

Full description

Bibliographic Details
Main Authors: Seyed-Hamid Mousavi, Mahdi Seyednezhad, Kin-Choong Yow
Format: Article
Language:English
Published: IEEE 2023-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/10285079/
_version_ 1797376328463286272
author Seyed-Hamid Mousavi
Mahdi Seyednezhad
Kin-Choong Yow
author_facet Seyed-Hamid Mousavi
Mahdi Seyednezhad
Kin-Choong Yow
author_sort Seyed-Hamid Mousavi
collection DOAJ
description Deep learning has been widely used in computer vision applications and it has been shown to achieve state-of-the-art results in many applications including self-driving cars. Despite the great progress, less attention has been paid to the safety-level importance of different classes and the majority of the models treat all classes similarly, and only average precision is considered. However, different classes contribute differently to the reliability and safety level of an autonomous driving system i.e. the Person class should be of higher priority than the Sky class in terms of segmentation accuracy. So, in this work, we introduced a new Attention-Convolution Block (ACB) feature extractor with modified self-attention, which can extract detailed and long-range information from the input feature maps and feed the entire network with more focused feature maps. Based on this feature extractor, we developed two models for semantic segmentation that have a balanced trade-off between complexity and accuracy and can accurately distinguish important classes, like the Person class. To demonstrate the performance of our models, we ran our experiments on Cityscapes datasets, and used both quantitative (mean and per-class IoU score) and qualitative (visual representation of output segmentation maps) measures and compared the results of our model with the state-of-the-art methods. The results show that our proposed model improves Per-class IoU scores for Person and Bike classes at least by 7 percent. In addition, we compared the accuracy of different models against their complexity to show despite simple structure and low parameter number, our proposed model has high IoU accuracy.
first_indexed 2024-03-08T19:36:58Z
format Article
id doaj.art-0ae933d1a70a46e3bb1b415f811631e8
institution Directory Open Access Journal
issn 2169-3536
language English
last_indexed 2024-03-08T19:36:58Z
publishDate 2023-01-01
publisher IEEE
record_format Article
series IEEE Access
spelling doaj.art-0ae933d1a70a46e3bb1b415f811631e82023-12-26T00:07:41ZengIEEEIEEE Access2169-35362023-01-011114214614216110.1109/ACCESS.2023.332460010285079Efficient Attention-Convolution Feature Extractor in Semantic Segmentation for Autonomous Driving SystemsSeyed-Hamid Mousavi0Mahdi Seyednezhad1Kin-Choong Yow2https://orcid.org/0000-0002-8610-661XFaculty of Engineering and Applied Sciences, University of Regina, Regina, CanadaDepartment of Computer Engineering and Sciences, Florida Institute of Technology, Melbourne, FL, USAFaculty of Engineering and Applied Sciences, University of Regina, Regina, CanadaDeep learning has been widely used in computer vision applications and it has been shown to achieve state-of-the-art results in many applications including self-driving cars. Despite the great progress, less attention has been paid to the safety-level importance of different classes and the majority of the models treat all classes similarly, and only average precision is considered. However, different classes contribute differently to the reliability and safety level of an autonomous driving system i.e. the Person class should be of higher priority than the Sky class in terms of segmentation accuracy. So, in this work, we introduced a new Attention-Convolution Block (ACB) feature extractor with modified self-attention, which can extract detailed and long-range information from the input feature maps and feed the entire network with more focused feature maps. Based on this feature extractor, we developed two models for semantic segmentation that have a balanced trade-off between complexity and accuracy and can accurately distinguish important classes, like the Person class. To demonstrate the performance of our models, we ran our experiments on Cityscapes datasets, and used both quantitative (mean and per-class IoU score) and qualitative (visual representation of output segmentation maps) measures and compared the results of our model with the state-of-the-art methods. The results show that our proposed model improves Per-class IoU scores for Person and Bike classes at least by 7 percent. In addition, we compared the accuracy of different models against their complexity to show despite simple structure and low parameter number, our proposed model has high IoU accuracy.https://ieeexplore.ieee.org/document/10285079/Semantic segmentationefficient self-attentioncosine similarity score functionefficient attention-convolution UNet
spellingShingle Seyed-Hamid Mousavi
Mahdi Seyednezhad
Kin-Choong Yow
Efficient Attention-Convolution Feature Extractor in Semantic Segmentation for Autonomous Driving Systems
IEEE Access
Semantic segmentation
efficient self-attention
cosine similarity score function
efficient attention-convolution UNet
title Efficient Attention-Convolution Feature Extractor in Semantic Segmentation for Autonomous Driving Systems
title_full Efficient Attention-Convolution Feature Extractor in Semantic Segmentation for Autonomous Driving Systems
title_fullStr Efficient Attention-Convolution Feature Extractor in Semantic Segmentation for Autonomous Driving Systems
title_full_unstemmed Efficient Attention-Convolution Feature Extractor in Semantic Segmentation for Autonomous Driving Systems
title_short Efficient Attention-Convolution Feature Extractor in Semantic Segmentation for Autonomous Driving Systems
title_sort efficient attention convolution feature extractor in semantic segmentation for autonomous driving systems
topic Semantic segmentation
efficient self-attention
cosine similarity score function
efficient attention-convolution UNet
url https://ieeexplore.ieee.org/document/10285079/
work_keys_str_mv AT seyedhamidmousavi efficientattentionconvolutionfeatureextractorinsemanticsegmentationforautonomousdrivingsystems
AT mahdiseyednezhad efficientattentionconvolutionfeatureextractorinsemanticsegmentationforautonomousdrivingsystems
AT kinchoongyow efficientattentionconvolutionfeatureextractorinsemanticsegmentationforautonomousdrivingsystems