A Learnable Joint Spatial and Spectral Transformation for High Resolution Remote Sensing Image Retrieval

Geometric and spectral distortions of remote sensing images are key obstacles for deep learning-based supervised classification and retrieval, which are worsened by cross-dataset applications. A learnable geometric transformation model imbedded in a deep learning model has been used as a tool for ha...

Full description

Bibliographic Details
Main Authors:	Yameng Wang, Shunping Ji, Yongjun Zhang
Format:	Article
Language:	English
Published:	IEEE 2021-01-01
Series:	IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing
Subjects:	Convolutional neural network (CNN) remote sensing image retrieval (RSIR) spatial transformation spectral transformation
Online Access:	https://ieeexplore.ieee.org/document/9512433/

_version_	1818601252458594304
author	Yameng Wang Shunping Ji Yongjun Zhang
author_facet	Yameng Wang Shunping Ji Yongjun Zhang
author_sort	Yameng Wang
collection	DOAJ
description	Geometric and spectral distortions of remote sensing images are key obstacles for deep learning-based supervised classification and retrieval, which are worsened by cross-dataset applications. A learnable geometric transformation model imbedded in a deep learning model has been used as a tool for handling geometric distortions to process close-range images with different view angles. However, a learnable spectral transformation model, which is more noteworthy in remote image processing, has not yet been designed and explored up to now. In this paper, we propose a learnable joint spatial and spectral transformation (JSST) model for remote sensing image retrieval (RSIR), which is composed of three modules: a parameter generation network (PGN); a spatial conversion module; and a spectral conversion module. The PGN adaptively learns the geometric and spectral transformation parameters simultaneously from the different input image content, and these parameters then guide the spatial and spectral conversions to produce a new modified image with geometric and spectral correction. Our learnable JSST is imbedded in the front-end of the deep-learning-based retrieval network. The spatial and spectral-modified inputs provided by the JSST endow the retrieval network with better generalization and adaptation ability for cross-dataset RSIR. Our experiments on four open-source RSIR datasets confirmed that our proposed JSST embedded retrieval network outperformed state-of-the-art approaches comprehensively.
first_indexed	2024-12-16T12:48:26Z
format	Article
id	doaj.art-55457e9fd3764fbdba3351e08f7aebd2
institution	Directory Open Access Journal
issn	2151-1535
language	English
last_indexed	2024-12-16T12:48:26Z
publishDate	2021-01-01
publisher	IEEE
record_format	Article
series	IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing
spelling	doaj.art-55457e9fd3764fbdba3351e08f7aebd22022-12-21T22:31:13ZengIEEEIEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing2151-15352021-01-01148100811210.1109/JSTARS.2021.31032169512433A Learnable Joint Spatial and Spectral Transformation for High Resolution Remote Sensing Image RetrievalYameng Wang0Shunping Ji1https://orcid.org/0000-0002-3088-1481Yongjun Zhang2https://orcid.org/0000-0001-9845-4251School of Remote Sensing and Information Engineering, Wuhan University, Wuhan, ChinaSchool of Remote Sensing and Information Engineering, Wuhan University, Wuhan, ChinaSchool of Remote Sensing and Information Engineering, Wuhan University, Wuhan, ChinaGeometric and spectral distortions of remote sensing images are key obstacles for deep learning-based supervised classification and retrieval, which are worsened by cross-dataset applications. A learnable geometric transformation model imbedded in a deep learning model has been used as a tool for handling geometric distortions to process close-range images with different view angles. However, a learnable spectral transformation model, which is more noteworthy in remote image processing, has not yet been designed and explored up to now. In this paper, we propose a learnable joint spatial and spectral transformation (JSST) model for remote sensing image retrieval (RSIR), which is composed of three modules: a parameter generation network (PGN); a spatial conversion module; and a spectral conversion module. The PGN adaptively learns the geometric and spectral transformation parameters simultaneously from the different input image content, and these parameters then guide the spatial and spectral conversions to produce a new modified image with geometric and spectral correction. Our learnable JSST is imbedded in the front-end of the deep-learning-based retrieval network. The spatial and spectral-modified inputs provided by the JSST endow the retrieval network with better generalization and adaptation ability for cross-dataset RSIR. Our experiments on four open-source RSIR datasets confirmed that our proposed JSST embedded retrieval network outperformed state-of-the-art approaches comprehensively.https://ieeexplore.ieee.org/document/9512433/Convolutional neural network (CNN)remote sensing image retrieval (RSIR)spatial transformationspectral transformation
spellingShingle	Yameng Wang Shunping Ji Yongjun Zhang A Learnable Joint Spatial and Spectral Transformation for High Resolution Remote Sensing Image Retrieval IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing Convolutional neural network (CNN) remote sensing image retrieval (RSIR) spatial transformation spectral transformation
title	A Learnable Joint Spatial and Spectral Transformation for High Resolution Remote Sensing Image Retrieval
title_full	A Learnable Joint Spatial and Spectral Transformation for High Resolution Remote Sensing Image Retrieval
title_fullStr	A Learnable Joint Spatial and Spectral Transformation for High Resolution Remote Sensing Image Retrieval
title_full_unstemmed	A Learnable Joint Spatial and Spectral Transformation for High Resolution Remote Sensing Image Retrieval
title_short	A Learnable Joint Spatial and Spectral Transformation for High Resolution Remote Sensing Image Retrieval
title_sort	learnable joint spatial and spectral transformation for high resolution remote sensing image retrieval
topic	Convolutional neural network (CNN) remote sensing image retrieval (RSIR) spatial transformation spectral transformation
url	https://ieeexplore.ieee.org/document/9512433/
work_keys_str_mv	AT yamengwang alearnablejointspatialandspectraltransformationforhighresolutionremotesensingimageretrieval AT shunpingji alearnablejointspatialandspectraltransformationforhighresolutionremotesensingimageretrieval AT yongjunzhang alearnablejointspatialandspectraltransformationforhighresolutionremotesensingimageretrieval AT yamengwang learnablejointspatialandspectraltransformationforhighresolutionremotesensingimageretrieval AT shunpingji learnablejointspatialandspectraltransformationforhighresolutionremotesensingimageretrieval AT yongjunzhang learnablejointspatialandspectraltransformationforhighresolutionremotesensingimageretrieval

A Learnable Joint Spatial and Spectral Transformation for High Resolution Remote Sensing Image Retrieval

Similar Items