IRE: Improved Image Super-Resolution Based on Real-ESRGAN

Image super-resolution (SR) is a research field focusing on image degradation techniques. The High-order Deterioration Model (HDM) implemented in Real-ESRGAN has proven more effective in simulating the degradation of real-world images compared to conventional bicubic kernel interpolation. However, i...

Full description

Bibliographic Details
Main Authors: Zhengwei Zhu, Yushi Lei, Yilin Qin, Chenyang Zhu, Yanping Zhu
Format: Article
Language:English
Published: IEEE 2023-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/10066280/
_version_ 1797828274998476800
author Zhengwei Zhu
Yushi Lei
Yilin Qin
Chenyang Zhu
Yanping Zhu
author_facet Zhengwei Zhu
Yushi Lei
Yilin Qin
Chenyang Zhu
Yanping Zhu
author_sort Zhengwei Zhu
collection DOAJ
description Image super-resolution (SR) is a research field focusing on image degradation techniques. The High-order Deterioration Model (HDM) implemented in Real-ESRGAN has proven more effective in simulating the degradation of real-world images compared to conventional bicubic kernel interpolation. However, images reconstructed by Real-ESRGAN suffer from two significant weaknesses. Firstly, the rebuilt image is overly smooth and suffers from substantial texture information loss, resulting in a worse performance than classical models such as SRGAN and ESRGAN. Secondly, the reconstructed images exhibit better visualization effects but are entirely different from the original image, violating the principle of image reconstruction. To address these issues, this paper presents an improved image SR model based on the HDM implemented in Real-ESRGAN. The first-order degradation modeling of HDM was removed, and only the second-order degradation modeling was kept to reduce the degree of visual deterioration. PatchGAN was used as the fundamental structure of the discriminator, and a channel attention mechanism was added to the generator’s dense block to enhance texture details in the reconstructed images. The L1 loss function was also replaced with the SmoothL1 loss function to improve convergence speed and model performance. The proposed model, IRE, was evaluated on various benchmark datasets and compared to Real-ESRGAN. The results show that the proposed model outperforms Real-ESRGAN regarding visual quality and measures such as RankIQA and NIQE. The study also indicates that PatchGAN, as the discriminator, reduces the average training time by approximately 28%.
first_indexed 2024-04-09T13:01:36Z
format Article
id doaj.art-5edee8b2b9b1416a9cb1841381f6a5cd
institution Directory Open Access Journal
issn 2169-3536
language English
last_indexed 2024-04-09T13:01:36Z
publishDate 2023-01-01
publisher IEEE
record_format Article
series IEEE Access
spelling doaj.art-5edee8b2b9b1416a9cb1841381f6a5cd2023-05-12T23:00:19ZengIEEEIEEE Access2169-35362023-01-0111453344534810.1109/ACCESS.2023.325608610066280IRE: Improved Image Super-Resolution Based on Real-ESRGANZhengwei Zhu0https://orcid.org/0000-0003-1187-1912Yushi Lei1Yilin Qin2Chenyang Zhu3https://orcid.org/0000-0002-2145-0559Yanping Zhu4https://orcid.org/0000-0002-8107-0101School of Microelectronics and Control Engineering, Changzhou University, Changzhou, ChinaSchool of Microelectronics and Control Engineering, Changzhou University, Changzhou, ChinaSchool of Computer Science, Changzhou Technical Institute of Tourism and Commerce, Changzhou, ChinaSchool of Computer Science and Artificial Intelligence, Changzhou University, Changzhou, ChinaSchool of Microelectronics and Control Engineering, Changzhou University, Changzhou, ChinaImage super-resolution (SR) is a research field focusing on image degradation techniques. The High-order Deterioration Model (HDM) implemented in Real-ESRGAN has proven more effective in simulating the degradation of real-world images compared to conventional bicubic kernel interpolation. However, images reconstructed by Real-ESRGAN suffer from two significant weaknesses. Firstly, the rebuilt image is overly smooth and suffers from substantial texture information loss, resulting in a worse performance than classical models such as SRGAN and ESRGAN. Secondly, the reconstructed images exhibit better visualization effects but are entirely different from the original image, violating the principle of image reconstruction. To address these issues, this paper presents an improved image SR model based on the HDM implemented in Real-ESRGAN. The first-order degradation modeling of HDM was removed, and only the second-order degradation modeling was kept to reduce the degree of visual deterioration. PatchGAN was used as the fundamental structure of the discriminator, and a channel attention mechanism was added to the generator’s dense block to enhance texture details in the reconstructed images. The L1 loss function was also replaced with the SmoothL1 loss function to improve convergence speed and model performance. The proposed model, IRE, was evaluated on various benchmark datasets and compared to Real-ESRGAN. The results show that the proposed model outperforms Real-ESRGAN regarding visual quality and measures such as RankIQA and NIQE. The study also indicates that PatchGAN, as the discriminator, reduces the average training time by approximately 28%.https://ieeexplore.ieee.org/document/10066280/Super resolutionreal-ESRGANSmoothL1channel attentionPatchGAN
spellingShingle Zhengwei Zhu
Yushi Lei
Yilin Qin
Chenyang Zhu
Yanping Zhu
IRE: Improved Image Super-Resolution Based on Real-ESRGAN
IEEE Access
Super resolution
real-ESRGAN
SmoothL1
channel attention
PatchGAN
title IRE: Improved Image Super-Resolution Based on Real-ESRGAN
title_full IRE: Improved Image Super-Resolution Based on Real-ESRGAN
title_fullStr IRE: Improved Image Super-Resolution Based on Real-ESRGAN
title_full_unstemmed IRE: Improved Image Super-Resolution Based on Real-ESRGAN
title_short IRE: Improved Image Super-Resolution Based on Real-ESRGAN
title_sort ire improved image super resolution based on real esrgan
topic Super resolution
real-ESRGAN
SmoothL1
channel attention
PatchGAN
url https://ieeexplore.ieee.org/document/10066280/
work_keys_str_mv AT zhengweizhu ireimprovedimagesuperresolutionbasedonrealesrgan
AT yushilei ireimprovedimagesuperresolutionbasedonrealesrgan
AT yilinqin ireimprovedimagesuperresolutionbasedonrealesrgan
AT chenyangzhu ireimprovedimagesuperresolutionbasedonrealesrgan
AT yanpingzhu ireimprovedimagesuperresolutionbasedonrealesrgan