Global Priors With Anchored-Stripe Attention and Multiscale Convolution for Remote Sensing Image Compression

Compressing remote sensing images with high spatial and spectral resolution plays an important role in subsequent image processing and information acquisition. Accurate data modeling can help the entropy model to better estimate the entropy value. For better image recovery, it is necessary to make f...

Full description

Bibliographic Details
Main Authors: Lei Zhang, Xugang Hu, Tianpeng Pan, Lili Zhang
Format: Article
Language:English
Published: IEEE 2024-01-01
Series:IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing
Subjects:
Online Access:https://ieeexplore.ieee.org/document/10292922/
Description
Summary:Compressing remote sensing images with high spatial and spectral resolution plays an important role in subsequent image processing and information acquisition. Accurate data modeling can help the entropy model to better estimate the entropy value. For better image recovery, it is necessary to make full use of the prior information contained in the latent information. To achieve global association and hierarchical modeling of latent elements, this article proposes adding additional global anchored-stripe self-attention capturing global, local, and interchannel dependencies. To enhance the feature extraction capabilities of the encoder and the decoder, the multiscale attention module of depthwise convolution is used to increase the receptive field and nonlinear conversion process, ensuring that the network can retain more useful information. We evaluate the compression performance of the proposed method in terms of rate–distortion curves and running speed. Through comparative experiments on DOTA, LoveDA, and UC-Merced datasets, it is shown that the proposed method has a faster running speed than that of the context model. It outperforms some traditional compression methods, such as BPG, WebP, JPEG2000, and state-of-the-art deep-learning-based methods, in terms of peak signal-to-noise ratio and multiscale structural similarity index measure. In terms of perceptual quality, adding perceptual loss reduces the smooth image blurring due to MSE loss, and the proposed method has better image perceptual quality under the approximate bits per pixel.
ISSN:2151-1535