Efficient Attention-Convolution Feature Extractor in Semantic Segmentation for Autonomous Driving Systems
Deep learning has been widely used in computer vision applications and it has been shown to achieve state-of-the-art results in many applications including self-driving cars. Despite the great progress, less attention has been paid to the safety-level importance of different classes and the majority...
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
IEEE
2023-01-01
|
Series: | IEEE Access |
Subjects: | |
Online Access: | https://ieeexplore.ieee.org/document/10285079/ |
_version_ | 1797376328463286272 |
---|---|
author | Seyed-Hamid Mousavi Mahdi Seyednezhad Kin-Choong Yow |
author_facet | Seyed-Hamid Mousavi Mahdi Seyednezhad Kin-Choong Yow |
author_sort | Seyed-Hamid Mousavi |
collection | DOAJ |
description | Deep learning has been widely used in computer vision applications and it has been shown to achieve state-of-the-art results in many applications including self-driving cars. Despite the great progress, less attention has been paid to the safety-level importance of different classes and the majority of the models treat all classes similarly, and only average precision is considered. However, different classes contribute differently to the reliability and safety level of an autonomous driving system i.e. the Person class should be of higher priority than the Sky class in terms of segmentation accuracy. So, in this work, we introduced a new Attention-Convolution Block (ACB) feature extractor with modified self-attention, which can extract detailed and long-range information from the input feature maps and feed the entire network with more focused feature maps. Based on this feature extractor, we developed two models for semantic segmentation that have a balanced trade-off between complexity and accuracy and can accurately distinguish important classes, like the Person class. To demonstrate the performance of our models, we ran our experiments on Cityscapes datasets, and used both quantitative (mean and per-class IoU score) and qualitative (visual representation of output segmentation maps) measures and compared the results of our model with the state-of-the-art methods. The results show that our proposed model improves Per-class IoU scores for Person and Bike classes at least by 7 percent. In addition, we compared the accuracy of different models against their complexity to show despite simple structure and low parameter number, our proposed model has high IoU accuracy. |
first_indexed | 2024-03-08T19:36:58Z |
format | Article |
id | doaj.art-0ae933d1a70a46e3bb1b415f811631e8 |
institution | Directory Open Access Journal |
issn | 2169-3536 |
language | English |
last_indexed | 2024-03-08T19:36:58Z |
publishDate | 2023-01-01 |
publisher | IEEE |
record_format | Article |
series | IEEE Access |
spelling | doaj.art-0ae933d1a70a46e3bb1b415f811631e82023-12-26T00:07:41ZengIEEEIEEE Access2169-35362023-01-011114214614216110.1109/ACCESS.2023.332460010285079Efficient Attention-Convolution Feature Extractor in Semantic Segmentation for Autonomous Driving SystemsSeyed-Hamid Mousavi0Mahdi Seyednezhad1Kin-Choong Yow2https://orcid.org/0000-0002-8610-661XFaculty of Engineering and Applied Sciences, University of Regina, Regina, CanadaDepartment of Computer Engineering and Sciences, Florida Institute of Technology, Melbourne, FL, USAFaculty of Engineering and Applied Sciences, University of Regina, Regina, CanadaDeep learning has been widely used in computer vision applications and it has been shown to achieve state-of-the-art results in many applications including self-driving cars. Despite the great progress, less attention has been paid to the safety-level importance of different classes and the majority of the models treat all classes similarly, and only average precision is considered. However, different classes contribute differently to the reliability and safety level of an autonomous driving system i.e. the Person class should be of higher priority than the Sky class in terms of segmentation accuracy. So, in this work, we introduced a new Attention-Convolution Block (ACB) feature extractor with modified self-attention, which can extract detailed and long-range information from the input feature maps and feed the entire network with more focused feature maps. Based on this feature extractor, we developed two models for semantic segmentation that have a balanced trade-off between complexity and accuracy and can accurately distinguish important classes, like the Person class. To demonstrate the performance of our models, we ran our experiments on Cityscapes datasets, and used both quantitative (mean and per-class IoU score) and qualitative (visual representation of output segmentation maps) measures and compared the results of our model with the state-of-the-art methods. The results show that our proposed model improves Per-class IoU scores for Person and Bike classes at least by 7 percent. In addition, we compared the accuracy of different models against their complexity to show despite simple structure and low parameter number, our proposed model has high IoU accuracy.https://ieeexplore.ieee.org/document/10285079/Semantic segmentationefficient self-attentioncosine similarity score functionefficient attention-convolution UNet |
spellingShingle | Seyed-Hamid Mousavi Mahdi Seyednezhad Kin-Choong Yow Efficient Attention-Convolution Feature Extractor in Semantic Segmentation for Autonomous Driving Systems IEEE Access Semantic segmentation efficient self-attention cosine similarity score function efficient attention-convolution UNet |
title | Efficient Attention-Convolution Feature Extractor in Semantic Segmentation for Autonomous Driving Systems |
title_full | Efficient Attention-Convolution Feature Extractor in Semantic Segmentation for Autonomous Driving Systems |
title_fullStr | Efficient Attention-Convolution Feature Extractor in Semantic Segmentation for Autonomous Driving Systems |
title_full_unstemmed | Efficient Attention-Convolution Feature Extractor in Semantic Segmentation for Autonomous Driving Systems |
title_short | Efficient Attention-Convolution Feature Extractor in Semantic Segmentation for Autonomous Driving Systems |
title_sort | efficient attention convolution feature extractor in semantic segmentation for autonomous driving systems |
topic | Semantic segmentation efficient self-attention cosine similarity score function efficient attention-convolution UNet |
url | https://ieeexplore.ieee.org/document/10285079/ |
work_keys_str_mv | AT seyedhamidmousavi efficientattentionconvolutionfeatureextractorinsemanticsegmentationforautonomousdrivingsystems AT mahdiseyednezhad efficientattentionconvolutionfeatureextractorinsemanticsegmentationforautonomousdrivingsystems AT kinchoongyow efficientattentionconvolutionfeatureextractorinsemanticsegmentationforautonomousdrivingsystems |