A Learnable Joint Spatial and Spectral Transformation for High Resolution Remote Sensing Image Retrieval

Geometric and spectral distortions of remote sensing images are key obstacles for deep learning-based supervised classification and retrieval, which are worsened by cross-dataset applications. A learnable geometric transformation model imbedded in a deep learning model has been used as a tool for ha...

Full description

Bibliographic Details
Main Authors: Yameng Wang, Shunping Ji, Yongjun Zhang
Format: Article
Language:English
Published: IEEE 2021-01-01
Series:IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing
Subjects:
Online Access:https://ieeexplore.ieee.org/document/9512433/
_version_ 1818601252458594304
author Yameng Wang
Shunping Ji
Yongjun Zhang
author_facet Yameng Wang
Shunping Ji
Yongjun Zhang
author_sort Yameng Wang
collection DOAJ
description Geometric and spectral distortions of remote sensing images are key obstacles for deep learning-based supervised classification and retrieval, which are worsened by cross-dataset applications. A learnable geometric transformation model imbedded in a deep learning model has been used as a tool for handling geometric distortions to process close-range images with different view angles. However, a learnable spectral transformation model, which is more noteworthy in remote image processing, has not yet been designed and explored up to now. In this paper, we propose a learnable joint spatial and spectral transformation (JSST) model for remote sensing image retrieval (RSIR), which is composed of three modules: a parameter generation network (PGN); a spatial conversion module; and a spectral conversion module. The PGN adaptively learns the geometric and spectral transformation parameters simultaneously from the different input image content, and these parameters then guide the spatial and spectral conversions to produce a new modified image with geometric and spectral correction. Our learnable JSST is imbedded in the front-end of the deep-learning-based retrieval network. The spatial and spectral-modified inputs provided by the JSST endow the retrieval network with better generalization and adaptation ability for cross-dataset RSIR. Our experiments on four open-source RSIR datasets confirmed that our proposed JSST embedded retrieval network outperformed state-of-the-art approaches comprehensively.
first_indexed 2024-12-16T12:48:26Z
format Article
id doaj.art-55457e9fd3764fbdba3351e08f7aebd2
institution Directory Open Access Journal
issn 2151-1535
language English
last_indexed 2024-12-16T12:48:26Z
publishDate 2021-01-01
publisher IEEE
record_format Article
series IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing
spelling doaj.art-55457e9fd3764fbdba3351e08f7aebd22022-12-21T22:31:13ZengIEEEIEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing2151-15352021-01-01148100811210.1109/JSTARS.2021.31032169512433A Learnable Joint Spatial and Spectral Transformation for High Resolution Remote Sensing Image RetrievalYameng Wang0Shunping Ji1https://orcid.org/0000-0002-3088-1481Yongjun Zhang2https://orcid.org/0000-0001-9845-4251School of Remote Sensing and Information Engineering, Wuhan University, Wuhan, ChinaSchool of Remote Sensing and Information Engineering, Wuhan University, Wuhan, ChinaSchool of Remote Sensing and Information Engineering, Wuhan University, Wuhan, ChinaGeometric and spectral distortions of remote sensing images are key obstacles for deep learning-based supervised classification and retrieval, which are worsened by cross-dataset applications. A learnable geometric transformation model imbedded in a deep learning model has been used as a tool for handling geometric distortions to process close-range images with different view angles. However, a learnable spectral transformation model, which is more noteworthy in remote image processing, has not yet been designed and explored up to now. In this paper, we propose a learnable joint spatial and spectral transformation (JSST) model for remote sensing image retrieval (RSIR), which is composed of three modules: a parameter generation network (PGN); a spatial conversion module; and a spectral conversion module. The PGN adaptively learns the geometric and spectral transformation parameters simultaneously from the different input image content, and these parameters then guide the spatial and spectral conversions to produce a new modified image with geometric and spectral correction. Our learnable JSST is imbedded in the front-end of the deep-learning-based retrieval network. The spatial and spectral-modified inputs provided by the JSST endow the retrieval network with better generalization and adaptation ability for cross-dataset RSIR. Our experiments on four open-source RSIR datasets confirmed that our proposed JSST embedded retrieval network outperformed state-of-the-art approaches comprehensively.https://ieeexplore.ieee.org/document/9512433/Convolutional neural network (CNN)remote sensing image retrieval (RSIR)spatial transformationspectral transformation
spellingShingle Yameng Wang
Shunping Ji
Yongjun Zhang
A Learnable Joint Spatial and Spectral Transformation for High Resolution Remote Sensing Image Retrieval
IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing
Convolutional neural network (CNN)
remote sensing image retrieval (RSIR)
spatial transformation
spectral transformation
title A Learnable Joint Spatial and Spectral Transformation for High Resolution Remote Sensing Image Retrieval
title_full A Learnable Joint Spatial and Spectral Transformation for High Resolution Remote Sensing Image Retrieval
title_fullStr A Learnable Joint Spatial and Spectral Transformation for High Resolution Remote Sensing Image Retrieval
title_full_unstemmed A Learnable Joint Spatial and Spectral Transformation for High Resolution Remote Sensing Image Retrieval
title_short A Learnable Joint Spatial and Spectral Transformation for High Resolution Remote Sensing Image Retrieval
title_sort learnable joint spatial and spectral transformation for high resolution remote sensing image retrieval
topic Convolutional neural network (CNN)
remote sensing image retrieval (RSIR)
spatial transformation
spectral transformation
url https://ieeexplore.ieee.org/document/9512433/
work_keys_str_mv AT yamengwang alearnablejointspatialandspectraltransformationforhighresolutionremotesensingimageretrieval
AT shunpingji alearnablejointspatialandspectraltransformationforhighresolutionremotesensingimageretrieval
AT yongjunzhang alearnablejointspatialandspectraltransformationforhighresolutionremotesensingimageretrieval
AT yamengwang learnablejointspatialandspectraltransformationforhighresolutionremotesensingimageretrieval
AT shunpingji learnablejointspatialandspectraltransformationforhighresolutionremotesensingimageretrieval
AT yongjunzhang learnablejointspatialandspectraltransformationforhighresolutionremotesensingimageretrieval