A Learnable Joint Spatial and Spectral Transformation for High Resolution Remote Sensing Image Retrieval
Geometric and spectral distortions of remote sensing images are key obstacles for deep learning-based supervised classification and retrieval, which are worsened by cross-dataset applications. A learnable geometric transformation model imbedded in a deep learning model has been used as a tool for ha...
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
IEEE
2021-01-01
|
Series: | IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing |
Subjects: | |
Online Access: | https://ieeexplore.ieee.org/document/9512433/ |
_version_ | 1818601252458594304 |
---|---|
author | Yameng Wang Shunping Ji Yongjun Zhang |
author_facet | Yameng Wang Shunping Ji Yongjun Zhang |
author_sort | Yameng Wang |
collection | DOAJ |
description | Geometric and spectral distortions of remote sensing images are key obstacles for deep learning-based supervised classification and retrieval, which are worsened by cross-dataset applications. A learnable geometric transformation model imbedded in a deep learning model has been used as a tool for handling geometric distortions to process close-range images with different view angles. However, a learnable spectral transformation model, which is more noteworthy in remote image processing, has not yet been designed and explored up to now. In this paper, we propose a learnable joint spatial and spectral transformation (JSST) model for remote sensing image retrieval (RSIR), which is composed of three modules: a parameter generation network (PGN); a spatial conversion module; and a spectral conversion module. The PGN adaptively learns the geometric and spectral transformation parameters simultaneously from the different input image content, and these parameters then guide the spatial and spectral conversions to produce a new modified image with geometric and spectral correction. Our learnable JSST is imbedded in the front-end of the deep-learning-based retrieval network. The spatial and spectral-modified inputs provided by the JSST endow the retrieval network with better generalization and adaptation ability for cross-dataset RSIR. Our experiments on four open-source RSIR datasets confirmed that our proposed JSST embedded retrieval network outperformed state-of-the-art approaches comprehensively. |
first_indexed | 2024-12-16T12:48:26Z |
format | Article |
id | doaj.art-55457e9fd3764fbdba3351e08f7aebd2 |
institution | Directory Open Access Journal |
issn | 2151-1535 |
language | English |
last_indexed | 2024-12-16T12:48:26Z |
publishDate | 2021-01-01 |
publisher | IEEE |
record_format | Article |
series | IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing |
spelling | doaj.art-55457e9fd3764fbdba3351e08f7aebd22022-12-21T22:31:13ZengIEEEIEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing2151-15352021-01-01148100811210.1109/JSTARS.2021.31032169512433A Learnable Joint Spatial and Spectral Transformation for High Resolution Remote Sensing Image RetrievalYameng Wang0Shunping Ji1https://orcid.org/0000-0002-3088-1481Yongjun Zhang2https://orcid.org/0000-0001-9845-4251School of Remote Sensing and Information Engineering, Wuhan University, Wuhan, ChinaSchool of Remote Sensing and Information Engineering, Wuhan University, Wuhan, ChinaSchool of Remote Sensing and Information Engineering, Wuhan University, Wuhan, ChinaGeometric and spectral distortions of remote sensing images are key obstacles for deep learning-based supervised classification and retrieval, which are worsened by cross-dataset applications. A learnable geometric transformation model imbedded in a deep learning model has been used as a tool for handling geometric distortions to process close-range images with different view angles. However, a learnable spectral transformation model, which is more noteworthy in remote image processing, has not yet been designed and explored up to now. In this paper, we propose a learnable joint spatial and spectral transformation (JSST) model for remote sensing image retrieval (RSIR), which is composed of three modules: a parameter generation network (PGN); a spatial conversion module; and a spectral conversion module. The PGN adaptively learns the geometric and spectral transformation parameters simultaneously from the different input image content, and these parameters then guide the spatial and spectral conversions to produce a new modified image with geometric and spectral correction. Our learnable JSST is imbedded in the front-end of the deep-learning-based retrieval network. The spatial and spectral-modified inputs provided by the JSST endow the retrieval network with better generalization and adaptation ability for cross-dataset RSIR. Our experiments on four open-source RSIR datasets confirmed that our proposed JSST embedded retrieval network outperformed state-of-the-art approaches comprehensively.https://ieeexplore.ieee.org/document/9512433/Convolutional neural network (CNN)remote sensing image retrieval (RSIR)spatial transformationspectral transformation |
spellingShingle | Yameng Wang Shunping Ji Yongjun Zhang A Learnable Joint Spatial and Spectral Transformation for High Resolution Remote Sensing Image Retrieval IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing Convolutional neural network (CNN) remote sensing image retrieval (RSIR) spatial transformation spectral transformation |
title | A Learnable Joint Spatial and Spectral Transformation for High Resolution Remote Sensing Image Retrieval |
title_full | A Learnable Joint Spatial and Spectral Transformation for High Resolution Remote Sensing Image Retrieval |
title_fullStr | A Learnable Joint Spatial and Spectral Transformation for High Resolution Remote Sensing Image Retrieval |
title_full_unstemmed | A Learnable Joint Spatial and Spectral Transformation for High Resolution Remote Sensing Image Retrieval |
title_short | A Learnable Joint Spatial and Spectral Transformation for High Resolution Remote Sensing Image Retrieval |
title_sort | learnable joint spatial and spectral transformation for high resolution remote sensing image retrieval |
topic | Convolutional neural network (CNN) remote sensing image retrieval (RSIR) spatial transformation spectral transformation |
url | https://ieeexplore.ieee.org/document/9512433/ |
work_keys_str_mv | AT yamengwang alearnablejointspatialandspectraltransformationforhighresolutionremotesensingimageretrieval AT shunpingji alearnablejointspatialandspectraltransformationforhighresolutionremotesensingimageretrieval AT yongjunzhang alearnablejointspatialandspectraltransformationforhighresolutionremotesensingimageretrieval AT yamengwang learnablejointspatialandspectraltransformationforhighresolutionremotesensingimageretrieval AT shunpingji learnablejointspatialandspectraltransformationforhighresolutionremotesensingimageretrieval AT yongjunzhang learnablejointspatialandspectraltransformationforhighresolutionremotesensingimageretrieval |