FMAM-Net: Fusion Multi-Scale Attention Mechanism Network for Building Segmentation in Remote Sensing Images

As the largest target in remote sensing images, buildings have important application value in urban planning and old city reconstruction. However, most networks have poor recognition ability on high resolution images, resulting in blurred boundaries in the segmented building maps. Then, the similari...

Full description

Bibliographic Details
Main Authors: Huanran Ye, Run Zhou, Jianhao Wang, Zhiliang Huang
Format: Article
Language:English
Published: IEEE 2022-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/9996387/
_version_ 1797966788442456064
author Huanran Ye
Run Zhou
Jianhao Wang
Zhiliang Huang
author_facet Huanran Ye
Run Zhou
Jianhao Wang
Zhiliang Huang
author_sort Huanran Ye
collection DOAJ
description As the largest target in remote sensing images, buildings have important application value in urban planning and old city reconstruction. However, most networks have poor recognition ability on high resolution images, resulting in blurred boundaries in the segmented building maps. Then, the similarity between buildings and backgrounds will lead to inter-class indistinction. Finally, the diversity of buildings brings difficulties to segmentation, which requires the network to have better generalization ability. To address these problems, we propose Fusion Multi-scale Attention Mechanism Network (FMAM-Net). Firstly, we design Feature Refine Compensation Module(FRCM) to improve the boundary ambiguity problem, including Feature Refinement Module(FRM) and Feature Compensation Module(FCM). FRM utilizes the densely connected architecture to refine features and increase recognition capabilities. FCM introduces low-level features to make up for the lack of boundary information in high-level features. Secondly, to handle inter-class indistinction, we design Tandem Attention Module(TAM) and Parallel Attention Module(PAM). TAM is designed to sequentially filter some features from channels and spaces for adaptive feature refinement. PAM combines context information and uses high-level features to guide low-level features to select more distinguishable features. Finally, based on the binary cross entropy loss function, we add an evaluation index to reduce the error caused by determining the optimization direction only through cross entropy. On the Inria Aerial Image Labeling Dataset, FMAM-Net achieves mean IoU of 85.34%, which is 5.58% higher than AMUNet and 3.77% higher than our baseline(U-Net ResNet-34). On the WHU Dataset, IoU reached the maximum value of 91.06% on FMAM-Net, 1.67% higher than SARB-UNet and 0.2% higher than MAP-Net. The visualization results show that FMAM-Net improves the fuzzy boundary of building segmentation and reduces the inter-class indistinction.
first_indexed 2024-04-11T02:20:53Z
format Article
id doaj.art-c7b36fbf18474251b6f05cf65eed91c7
institution Directory Open Access Journal
issn 2169-3536
language English
last_indexed 2024-04-11T02:20:53Z
publishDate 2022-01-01
publisher IEEE
record_format Article
series IEEE Access
spelling doaj.art-c7b36fbf18474251b6f05cf65eed91c72023-01-03T00:00:11ZengIEEEIEEE Access2169-35362022-01-011013424113425110.1109/ACCESS.2022.32313629996387FMAM-Net: Fusion Multi-Scale Attention Mechanism Network for Building Segmentation in Remote Sensing ImagesHuanran Ye0https://orcid.org/0000-0001-6658-0368Run Zhou1Jianhao Wang2https://orcid.org/0000-0002-9782-3243Zhiliang Huang3https://orcid.org/0000-0003-0876-063XSchool of Mechanical and Electrical Information, Yiwu Industrial & Commercial College, Jinhua, ChinaSchool of Mechanical and Electrical Information, Yiwu Industrial & Commercial College, Jinhua, ChinaSchool of Mechanical and Electrical Information, Yiwu Industrial & Commercial College, Jinhua, ChinaCollege of Physics and Electronic Information Engineering, Zhejiang Normal University, Jinhua, ChinaAs the largest target in remote sensing images, buildings have important application value in urban planning and old city reconstruction. However, most networks have poor recognition ability on high resolution images, resulting in blurred boundaries in the segmented building maps. Then, the similarity between buildings and backgrounds will lead to inter-class indistinction. Finally, the diversity of buildings brings difficulties to segmentation, which requires the network to have better generalization ability. To address these problems, we propose Fusion Multi-scale Attention Mechanism Network (FMAM-Net). Firstly, we design Feature Refine Compensation Module(FRCM) to improve the boundary ambiguity problem, including Feature Refinement Module(FRM) and Feature Compensation Module(FCM). FRM utilizes the densely connected architecture to refine features and increase recognition capabilities. FCM introduces low-level features to make up for the lack of boundary information in high-level features. Secondly, to handle inter-class indistinction, we design Tandem Attention Module(TAM) and Parallel Attention Module(PAM). TAM is designed to sequentially filter some features from channels and spaces for adaptive feature refinement. PAM combines context information and uses high-level features to guide low-level features to select more distinguishable features. Finally, based on the binary cross entropy loss function, we add an evaluation index to reduce the error caused by determining the optimization direction only through cross entropy. On the Inria Aerial Image Labeling Dataset, FMAM-Net achieves mean IoU of 85.34%, which is 5.58% higher than AMUNet and 3.77% higher than our baseline(U-Net ResNet-34). On the WHU Dataset, IoU reached the maximum value of 91.06% on FMAM-Net, 1.67% higher than SARB-UNet and 0.2% higher than MAP-Net. The visualization results show that FMAM-Net improves the fuzzy boundary of building segmentation and reduces the inter-class indistinction.https://ieeexplore.ieee.org/document/9996387/Remote sensing imagebuilding segmentationattention mechanismfeature refinementencoder-decoder
spellingShingle Huanran Ye
Run Zhou
Jianhao Wang
Zhiliang Huang
FMAM-Net: Fusion Multi-Scale Attention Mechanism Network for Building Segmentation in Remote Sensing Images
IEEE Access
Remote sensing image
building segmentation
attention mechanism
feature refinement
encoder-decoder
title FMAM-Net: Fusion Multi-Scale Attention Mechanism Network for Building Segmentation in Remote Sensing Images
title_full FMAM-Net: Fusion Multi-Scale Attention Mechanism Network for Building Segmentation in Remote Sensing Images
title_fullStr FMAM-Net: Fusion Multi-Scale Attention Mechanism Network for Building Segmentation in Remote Sensing Images
title_full_unstemmed FMAM-Net: Fusion Multi-Scale Attention Mechanism Network for Building Segmentation in Remote Sensing Images
title_short FMAM-Net: Fusion Multi-Scale Attention Mechanism Network for Building Segmentation in Remote Sensing Images
title_sort fmam net fusion multi scale attention mechanism network for building segmentation in remote sensing images
topic Remote sensing image
building segmentation
attention mechanism
feature refinement
encoder-decoder
url https://ieeexplore.ieee.org/document/9996387/
work_keys_str_mv AT huanranye fmamnetfusionmultiscaleattentionmechanismnetworkforbuildingsegmentationinremotesensingimages
AT runzhou fmamnetfusionmultiscaleattentionmechanismnetworkforbuildingsegmentationinremotesensingimages
AT jianhaowang fmamnetfusionmultiscaleattentionmechanismnetworkforbuildingsegmentationinremotesensingimages
AT zhilianghuang fmamnetfusionmultiscaleattentionmechanismnetworkforbuildingsegmentationinremotesensingimages