Summary: | Recently, with the rise and progress of convolutional neural networks (CNNs), CNN-based remote-sensing image super-resolution (RSSR) methods have gained considerable advancement and showed great power for image reconstruction tasks. However, most of these methods cannot handle well the enormous number of objects with different scales contained in remote-sensing images and thus limits super-resolution performance. To address these issues, we propose a multiscale fast Fourier transform (FFT) based attention network (MSFFTAN), which employs a multiinput U-shape structure as backbone for accurate RSSR. Specifically, we carefully design an FFT-based residual block consisting of an image domain branch and a Fourier domain branch to extract local details and global structures simultaneously. In addition, a local–global channel attention block is developed to further enhance the reconstruction ability of small targets. Finally, we present a branch gated selective block to adaptively explore and aggregate features from multiple scales and depths. Extensive experiments on two public datasets have demonstrated the superiority of MSFFTAN over the state-of-the-art (SOAT) approaches in aspects of both quantitative metrics and visual quality. The peak signal-to-noise ratio of our network is 1.5 dB higher than the SOAT method on the UCMerced LandUse with downscaling factor 2.
|