IRE: Improved Image Super-Resolution Based on Real-ESRGAN
Image super-resolution (SR) is a research field focusing on image degradation techniques. The High-order Deterioration Model (HDM) implemented in Real-ESRGAN has proven more effective in simulating the degradation of real-world images compared to conventional bicubic kernel interpolation. However, i...
Main Authors: | , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
IEEE
2023-01-01
|
Series: | IEEE Access |
Subjects: | |
Online Access: | https://ieeexplore.ieee.org/document/10066280/ |
_version_ | 1797828274998476800 |
---|---|
author | Zhengwei Zhu Yushi Lei Yilin Qin Chenyang Zhu Yanping Zhu |
author_facet | Zhengwei Zhu Yushi Lei Yilin Qin Chenyang Zhu Yanping Zhu |
author_sort | Zhengwei Zhu |
collection | DOAJ |
description | Image super-resolution (SR) is a research field focusing on image degradation techniques. The High-order Deterioration Model (HDM) implemented in Real-ESRGAN has proven more effective in simulating the degradation of real-world images compared to conventional bicubic kernel interpolation. However, images reconstructed by Real-ESRGAN suffer from two significant weaknesses. Firstly, the rebuilt image is overly smooth and suffers from substantial texture information loss, resulting in a worse performance than classical models such as SRGAN and ESRGAN. Secondly, the reconstructed images exhibit better visualization effects but are entirely different from the original image, violating the principle of image reconstruction. To address these issues, this paper presents an improved image SR model based on the HDM implemented in Real-ESRGAN. The first-order degradation modeling of HDM was removed, and only the second-order degradation modeling was kept to reduce the degree of visual deterioration. PatchGAN was used as the fundamental structure of the discriminator, and a channel attention mechanism was added to the generator’s dense block to enhance texture details in the reconstructed images. The L1 loss function was also replaced with the SmoothL1 loss function to improve convergence speed and model performance. The proposed model, IRE, was evaluated on various benchmark datasets and compared to Real-ESRGAN. The results show that the proposed model outperforms Real-ESRGAN regarding visual quality and measures such as RankIQA and NIQE. The study also indicates that PatchGAN, as the discriminator, reduces the average training time by approximately 28%. |
first_indexed | 2024-04-09T13:01:36Z |
format | Article |
id | doaj.art-5edee8b2b9b1416a9cb1841381f6a5cd |
institution | Directory Open Access Journal |
issn | 2169-3536 |
language | English |
last_indexed | 2024-04-09T13:01:36Z |
publishDate | 2023-01-01 |
publisher | IEEE |
record_format | Article |
series | IEEE Access |
spelling | doaj.art-5edee8b2b9b1416a9cb1841381f6a5cd2023-05-12T23:00:19ZengIEEEIEEE Access2169-35362023-01-0111453344534810.1109/ACCESS.2023.325608610066280IRE: Improved Image Super-Resolution Based on Real-ESRGANZhengwei Zhu0https://orcid.org/0000-0003-1187-1912Yushi Lei1Yilin Qin2Chenyang Zhu3https://orcid.org/0000-0002-2145-0559Yanping Zhu4https://orcid.org/0000-0002-8107-0101School of Microelectronics and Control Engineering, Changzhou University, Changzhou, ChinaSchool of Microelectronics and Control Engineering, Changzhou University, Changzhou, ChinaSchool of Computer Science, Changzhou Technical Institute of Tourism and Commerce, Changzhou, ChinaSchool of Computer Science and Artificial Intelligence, Changzhou University, Changzhou, ChinaSchool of Microelectronics and Control Engineering, Changzhou University, Changzhou, ChinaImage super-resolution (SR) is a research field focusing on image degradation techniques. The High-order Deterioration Model (HDM) implemented in Real-ESRGAN has proven more effective in simulating the degradation of real-world images compared to conventional bicubic kernel interpolation. However, images reconstructed by Real-ESRGAN suffer from two significant weaknesses. Firstly, the rebuilt image is overly smooth and suffers from substantial texture information loss, resulting in a worse performance than classical models such as SRGAN and ESRGAN. Secondly, the reconstructed images exhibit better visualization effects but are entirely different from the original image, violating the principle of image reconstruction. To address these issues, this paper presents an improved image SR model based on the HDM implemented in Real-ESRGAN. The first-order degradation modeling of HDM was removed, and only the second-order degradation modeling was kept to reduce the degree of visual deterioration. PatchGAN was used as the fundamental structure of the discriminator, and a channel attention mechanism was added to the generator’s dense block to enhance texture details in the reconstructed images. The L1 loss function was also replaced with the SmoothL1 loss function to improve convergence speed and model performance. The proposed model, IRE, was evaluated on various benchmark datasets and compared to Real-ESRGAN. The results show that the proposed model outperforms Real-ESRGAN regarding visual quality and measures such as RankIQA and NIQE. The study also indicates that PatchGAN, as the discriminator, reduces the average training time by approximately 28%.https://ieeexplore.ieee.org/document/10066280/Super resolutionreal-ESRGANSmoothL1channel attentionPatchGAN |
spellingShingle | Zhengwei Zhu Yushi Lei Yilin Qin Chenyang Zhu Yanping Zhu IRE: Improved Image Super-Resolution Based on Real-ESRGAN IEEE Access Super resolution real-ESRGAN SmoothL1 channel attention PatchGAN |
title | IRE: Improved Image Super-Resolution Based on Real-ESRGAN |
title_full | IRE: Improved Image Super-Resolution Based on Real-ESRGAN |
title_fullStr | IRE: Improved Image Super-Resolution Based on Real-ESRGAN |
title_full_unstemmed | IRE: Improved Image Super-Resolution Based on Real-ESRGAN |
title_short | IRE: Improved Image Super-Resolution Based on Real-ESRGAN |
title_sort | ire improved image super resolution based on real esrgan |
topic | Super resolution real-ESRGAN SmoothL1 channel attention PatchGAN |
url | https://ieeexplore.ieee.org/document/10066280/ |
work_keys_str_mv | AT zhengweizhu ireimprovedimagesuperresolutionbasedonrealesrgan AT yushilei ireimprovedimagesuperresolutionbasedonrealesrgan AT yilinqin ireimprovedimagesuperresolutionbasedonrealesrgan AT chenyangzhu ireimprovedimagesuperresolutionbasedonrealesrgan AT yanpingzhu ireimprovedimagesuperresolutionbasedonrealesrgan |