MSFusion: Multistage for Remote Sensing Image Spatiotemporal Fusion Based on Texture Transformer and Convolutional Neural Network

Due to the limitations of current technology and budget, a single satellite sensor cannot obtain high spatiotemporal resolution remote sensing images. Therefore, remote sensing image spatio-temporal fusion technology is considered as an effective solution and has attracted extensive attention. In th...

Full description

Bibliographic Details
Main Authors: Guangqi Yang, Yurong Qian, Hui Liu, Bochuan Tang, Ranran Qi, Yi Lu, Jun Geng
Format: Article
Language:English
Published: IEEE 2022-01-01
Series:IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing
Subjects:
Online Access:https://ieeexplore.ieee.org/document/9786792/
_version_ 1811332169846489088
author Guangqi Yang
Yurong Qian
Hui Liu
Bochuan Tang
Ranran Qi
Yi Lu
Jun Geng
author_facet Guangqi Yang
Yurong Qian
Hui Liu
Bochuan Tang
Ranran Qi
Yi Lu
Jun Geng
author_sort Guangqi Yang
collection DOAJ
description Due to the limitations of current technology and budget, a single satellite sensor cannot obtain high spatiotemporal resolution remote sensing images. Therefore, remote sensing image spatio-temporal fusion technology is considered as an effective solution and has attracted extensive attention. In the field of deep learning, due to the fixed size of the perception field of a convolutional neural network, it is impossible to model the correlation of global features, and the features extracted only through convolution operation lack the ability to capture long-distance features. At the same time, complex fusion methods cannot better integrate temporal and spatial features. In order to solve these problems, we propose a multistage remote sensing image spatio-temporal fusion model based on texture transformer and convolutional neural network. The model combines the advantages of transformer and convolutional network, uses a lightweight convolution network to extract spatial features and temporal discrepancy features, uses transformer to learn global temporal correlation, and finally, fuses temporal features with spatial features. In order to make full use of the features obtained in different stages, we design a cross-stage adaptive fusion module CSAFM. The module adopts the self-attention mechanism to adaptively integrate the features of different scales while considering the temporal and spatial characteristics. To test the robustness of the model, the experiments are carried out on three datasets of CIA, LGC, and DX. Compared with five typical spatio-temporal fusion algorithms, we obtain excellent results, which prove the superiority of MSFusion model.
first_indexed 2024-04-13T16:32:45Z
format Article
id doaj.art-b1d1e7abf9f74dd8a283c3c95d92c399
institution Directory Open Access Journal
issn 2151-1535
language English
last_indexed 2024-04-13T16:32:45Z
publishDate 2022-01-01
publisher IEEE
record_format Article
series IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing
spelling doaj.art-b1d1e7abf9f74dd8a283c3c95d92c3992022-12-22T02:39:32ZengIEEEIEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing2151-15352022-01-01154653466610.1109/JSTARS.2022.31794159786792MSFusion: Multistage for Remote Sensing Image Spatiotemporal Fusion Based on Texture Transformer and Convolutional Neural NetworkGuangqi Yang0https://orcid.org/0000-0001-5588-864XYurong Qian1Hui Liu2https://orcid.org/0000-0002-7868-8314Bochuan Tang3Ranran Qi4Yi Lu5Jun Geng6School of Software, Key Laboratory of signal detection and processing in Xinjiang Uygur Autonomous Region, Xinjiang University, Urumqi, ChinaSchool of Software, Key Laboratory of signal detection and processing in Xinjiang Uygur Autonomous Region, Xinjiang University, Urumqi, ChinaKey Laboratory of Software Engineering, Urumqi, ChinaSchool of Software, Key Laboratory of signal detection and processing in Xinjiang Uygur Autonomous Region, Xinjiang University, Urumqi, ChinaSchool of Software, Key Laboratory of signal detection and processing in Xinjiang Uygur Autonomous Region, Xinjiang University, Urumqi, ChinaSchool of Software, Key Laboratory of signal detection and processing in Xinjiang Uygur Autonomous Region, Xinjiang University, Urumqi, ChinaSchool of Software, Xinjiang University, Urumqi, ChinaDue to the limitations of current technology and budget, a single satellite sensor cannot obtain high spatiotemporal resolution remote sensing images. Therefore, remote sensing image spatio-temporal fusion technology is considered as an effective solution and has attracted extensive attention. In the field of deep learning, due to the fixed size of the perception field of a convolutional neural network, it is impossible to model the correlation of global features, and the features extracted only through convolution operation lack the ability to capture long-distance features. At the same time, complex fusion methods cannot better integrate temporal and spatial features. In order to solve these problems, we propose a multistage remote sensing image spatio-temporal fusion model based on texture transformer and convolutional neural network. The model combines the advantages of transformer and convolutional network, uses a lightweight convolution network to extract spatial features and temporal discrepancy features, uses transformer to learn global temporal correlation, and finally, fuses temporal features with spatial features. In order to make full use of the features obtained in different stages, we design a cross-stage adaptive fusion module CSAFM. The module adopts the self-attention mechanism to adaptively integrate the features of different scales while considering the temporal and spatial characteristics. To test the robustness of the model, the experiments are carried out on three datasets of CIA, LGC, and DX. Compared with five typical spatio-temporal fusion algorithms, we obtain excellent results, which prove the superiority of MSFusion model.https://ieeexplore.ieee.org/document/9786792/Multistage feature fusionmultitemporal remote sensing dataremote sensingself-attentionspatiotemporal fusiontransformer
spellingShingle Guangqi Yang
Yurong Qian
Hui Liu
Bochuan Tang
Ranran Qi
Yi Lu
Jun Geng
MSFusion: Multistage for Remote Sensing Image Spatiotemporal Fusion Based on Texture Transformer and Convolutional Neural Network
IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing
Multistage feature fusion
multitemporal remote sensing data
remote sensing
self-attention
spatiotemporal fusion
transformer
title MSFusion: Multistage for Remote Sensing Image Spatiotemporal Fusion Based on Texture Transformer and Convolutional Neural Network
title_full MSFusion: Multistage for Remote Sensing Image Spatiotemporal Fusion Based on Texture Transformer and Convolutional Neural Network
title_fullStr MSFusion: Multistage for Remote Sensing Image Spatiotemporal Fusion Based on Texture Transformer and Convolutional Neural Network
title_full_unstemmed MSFusion: Multistage for Remote Sensing Image Spatiotemporal Fusion Based on Texture Transformer and Convolutional Neural Network
title_short MSFusion: Multistage for Remote Sensing Image Spatiotemporal Fusion Based on Texture Transformer and Convolutional Neural Network
title_sort msfusion multistage for remote sensing image spatiotemporal fusion based on texture transformer and convolutional neural network
topic Multistage feature fusion
multitemporal remote sensing data
remote sensing
self-attention
spatiotemporal fusion
transformer
url https://ieeexplore.ieee.org/document/9786792/
work_keys_str_mv AT guangqiyang msfusionmultistageforremotesensingimagespatiotemporalfusionbasedontexturetransformerandconvolutionalneuralnetwork
AT yurongqian msfusionmultistageforremotesensingimagespatiotemporalfusionbasedontexturetransformerandconvolutionalneuralnetwork
AT huiliu msfusionmultistageforremotesensingimagespatiotemporalfusionbasedontexturetransformerandconvolutionalneuralnetwork
AT bochuantang msfusionmultistageforremotesensingimagespatiotemporalfusionbasedontexturetransformerandconvolutionalneuralnetwork
AT ranranqi msfusionmultistageforremotesensingimagespatiotemporalfusionbasedontexturetransformerandconvolutionalneuralnetwork
AT yilu msfusionmultistageforremotesensingimagespatiotemporalfusionbasedontexturetransformerandconvolutionalneuralnetwork
AT jungeng msfusionmultistageforremotesensingimagespatiotemporalfusionbasedontexturetransformerandconvolutionalneuralnetwork