Water index and Swin Transformer Ensemble (WISTE) for water body extraction from multispectral remote sensing images

Automatic surface water body mapping using remote sensing technology is greatly meaningful for studying inland water dynamics at regional to global scales. Convolutional neural networks (CNN) have become an efficient semantic segmentation technique for the interpretation of remote sensing images. Ho...

Full description

Bibliographic Details
Main Authors: Donghui Ma, Liguang Jiang, Jie Li, Yun Shi
Format: Article
Language:English
Published: Taylor & Francis Group 2023-12-01
Series:GIScience & Remote Sensing
Subjects:
Online Access:http://dx.doi.org/10.1080/15481603.2023.2251704
_version_ 1797679027333365760
author Donghui Ma
Liguang Jiang
Jie Li
Yun Shi
author_facet Donghui Ma
Liguang Jiang
Jie Li
Yun Shi
author_sort Donghui Ma
collection DOAJ
description Automatic surface water body mapping using remote sensing technology is greatly meaningful for studying inland water dynamics at regional to global scales. Convolutional neural networks (CNN) have become an efficient semantic segmentation technique for the interpretation of remote sensing images. However, the receptive field value of a CNN is restricted by the convolutional kernel size because the network only focuses on local features. The Swin Transformer has recently demonstrated its outstanding performance in computer vision tasks, and it could be useful for processing multispectral remote sensing images. In this article, a Water Index and Swin Transformer Ensemble (WISTE) method for automatic water body extraction is proposed. First, a dual-branch encoder architecture is designed for the Swin Transformer, aggregating the global semantic information and pixel neighbor relationships captured by fully convolutional networks (FCN) and multihead self-attention. Second, to prevent the Swin Transformer from ignoring multispectral information, we construct a prediction map ensemble module. The predictions of the Swin Transformer and the Normalized Difference Water Index (NDWI) are combined by a Bayesian averaging strategy. Finally, the experimental results obtained on two distinct datasets demonstrate that the WISTE has advantages over other segmentation methods and achieves the best results. The method proposed in this study can be used for improving regional to continental surface water mapping and related hydrological studies.
first_indexed 2024-03-11T23:08:26Z
format Article
id doaj.art-6f8741a8805344288a1a8027a68a20a6
institution Directory Open Access Journal
issn 1548-1603
1943-7226
language English
last_indexed 2024-03-11T23:08:26Z
publishDate 2023-12-01
publisher Taylor & Francis Group
record_format Article
series GIScience & Remote Sensing
spelling doaj.art-6f8741a8805344288a1a8027a68a20a62023-09-21T12:43:10ZengTaylor & Francis GroupGIScience & Remote Sensing1548-16031943-72262023-12-0160110.1080/15481603.2023.22517042251704Water index and Swin Transformer Ensemble (WISTE) for water body extraction from multispectral remote sensing imagesDonghui Ma0Liguang Jiang1Jie Li2Yun Shi3Geovis Spatial Technology Co.LtdSchool of Environmental Science and Engineering, Southern University of Science and TechnologyXi’an Surveying and Mapping InstituteXi’an University of Science and TechnologyAutomatic surface water body mapping using remote sensing technology is greatly meaningful for studying inland water dynamics at regional to global scales. Convolutional neural networks (CNN) have become an efficient semantic segmentation technique for the interpretation of remote sensing images. However, the receptive field value of a CNN is restricted by the convolutional kernel size because the network only focuses on local features. The Swin Transformer has recently demonstrated its outstanding performance in computer vision tasks, and it could be useful for processing multispectral remote sensing images. In this article, a Water Index and Swin Transformer Ensemble (WISTE) method for automatic water body extraction is proposed. First, a dual-branch encoder architecture is designed for the Swin Transformer, aggregating the global semantic information and pixel neighbor relationships captured by fully convolutional networks (FCN) and multihead self-attention. Second, to prevent the Swin Transformer from ignoring multispectral information, we construct a prediction map ensemble module. The predictions of the Swin Transformer and the Normalized Difference Water Index (NDWI) are combined by a Bayesian averaging strategy. Finally, the experimental results obtained on two distinct datasets demonstrate that the WISTE has advantages over other segmentation methods and achieves the best results. The method proposed in this study can be used for improving regional to continental surface water mapping and related hydrological studies.http://dx.doi.org/10.1080/15481603.2023.2251704deep learningwater body extractionswin transformerwater indexdual-encoder
spellingShingle Donghui Ma
Liguang Jiang
Jie Li
Yun Shi
Water index and Swin Transformer Ensemble (WISTE) for water body extraction from multispectral remote sensing images
GIScience & Remote Sensing
deep learning
water body extraction
swin transformer
water index
dual-encoder
title Water index and Swin Transformer Ensemble (WISTE) for water body extraction from multispectral remote sensing images
title_full Water index and Swin Transformer Ensemble (WISTE) for water body extraction from multispectral remote sensing images
title_fullStr Water index and Swin Transformer Ensemble (WISTE) for water body extraction from multispectral remote sensing images
title_full_unstemmed Water index and Swin Transformer Ensemble (WISTE) for water body extraction from multispectral remote sensing images
title_short Water index and Swin Transformer Ensemble (WISTE) for water body extraction from multispectral remote sensing images
title_sort water index and swin transformer ensemble wiste for water body extraction from multispectral remote sensing images
topic deep learning
water body extraction
swin transformer
water index
dual-encoder
url http://dx.doi.org/10.1080/15481603.2023.2251704
work_keys_str_mv AT donghuima waterindexandswintransformerensemblewisteforwaterbodyextractionfrommultispectralremotesensingimages
AT liguangjiang waterindexandswintransformerensemblewisteforwaterbodyextractionfrommultispectralremotesensingimages
AT jieli waterindexandswintransformerensemblewisteforwaterbodyextractionfrommultispectralremotesensingimages
AT yunshi waterindexandswintransformerensemblewisteforwaterbodyextractionfrommultispectralremotesensingimages