Video-Restoration-Net: Deep Generative Model with Non-Local Network for Inpainting and Super-Resolution Tasks

Although deep learning-based approaches for video processing have been extensively investigated, the lack of generality in network construction makes it challenging for practical applications, particularly in video restoration. As a result, this paper presents a universal video restoration model tha...

Full description

Bibliographic Details
Main Authors:	Yuanfeng Zheng, Yuchen Yan, Hao Jiang
Format:	Article
Language:	English
Published:	MDPI AG 2023-09-01
Series:	Applied Sciences
Subjects:	video restoration video inpainting super-resolution non-local
Online Access:	https://www.mdpi.com/2076-3417/13/18/10001

_version_	1797581450846928896
author	Yuanfeng Zheng Yuchen Yan Hao Jiang
author_facet	Yuanfeng Zheng Yuchen Yan Hao Jiang
author_sort	Yuanfeng Zheng
collection	DOAJ
description	Although deep learning-based approaches for video processing have been extensively investigated, the lack of generality in network construction makes it challenging for practical applications, particularly in video restoration. As a result, this paper presents a universal video restoration model that can simultaneously tackle video inpainting and super-resolution tasks. The network, called Video-Restoration-Net (VRN), consists of four components: (1) an encoder to extract features from each frame, (2) a non-local network that recombines features from adjacent frames or different locations of a given frame, (3) a decoder to restore the coarse video from the output of a non-local block, and (4) a refinement network to refine the coarse video on the frame level. The framework is trained in a three-step pipeline to improve training stability for both tasks. Specifically, we first suggest an automated technique to generate full video datasets for super-resolution reconstruction and another complete-incomplete video dataset for inpainting, respectively. A VRN is then trained to inpaint the incomplete videos. Meanwhile, the full video datasets are adopted to train another VRN frame-wisely and validate it against authoritative datasets. We show quantitative comparisons with several baseline models, achieving 40.5042 dB/0.99473 on PSNR/SSIM in the inpainting task, while during the SR task we obtained 28.41 dB/0.7953 and 27.25/0.8152 on BSD100 and Urban100, respectively. The qualitative comparisons demonstrate that our proposed model is able to complete masked regions and implement super-resolution reconstruction in videos of high quality. Furthermore, the above results show that our method has greater versatility both in video inpainting and super-resolution tasks compared to recent models.
first_indexed	2024-03-10T23:05:38Z
format	Article
id	doaj.art-948da5024aa941258535d9cc860c760d
institution	Directory Open Access Journal
issn	2076-3417
language	English
last_indexed	2024-03-10T23:05:38Z
publishDate	2023-09-01
publisher	MDPI AG
record_format	Article
series	Applied Sciences
spelling	doaj.art-948da5024aa941258535d9cc860c760d2023-11-19T09:21:38ZengMDPI AGApplied Sciences2076-34172023-09-0113181000110.3390/app131810001Video-Restoration-Net: Deep Generative Model with Non-Local Network for Inpainting and Super-Resolution TasksYuanfeng Zheng0Yuchen Yan1Hao Jiang2School of Electronic Information, Wuhan University, Wuhan 430072, ChinaSchool of Electronic Information, Wuhan University, Wuhan 430072, ChinaSchool of Electronic Information, Wuhan University, Wuhan 430072, ChinaAlthough deep learning-based approaches for video processing have been extensively investigated, the lack of generality in network construction makes it challenging for practical applications, particularly in video restoration. As a result, this paper presents a universal video restoration model that can simultaneously tackle video inpainting and super-resolution tasks. The network, called Video-Restoration-Net (VRN), consists of four components: (1) an encoder to extract features from each frame, (2) a non-local network that recombines features from adjacent frames or different locations of a given frame, (3) a decoder to restore the coarse video from the output of a non-local block, and (4) a refinement network to refine the coarse video on the frame level. The framework is trained in a three-step pipeline to improve training stability for both tasks. Specifically, we first suggest an automated technique to generate full video datasets for super-resolution reconstruction and another complete-incomplete video dataset for inpainting, respectively. A VRN is then trained to inpaint the incomplete videos. Meanwhile, the full video datasets are adopted to train another VRN frame-wisely and validate it against authoritative datasets. We show quantitative comparisons with several baseline models, achieving 40.5042 dB/0.99473 on PSNR/SSIM in the inpainting task, while during the SR task we obtained 28.41 dB/0.7953 and 27.25/0.8152 on BSD100 and Urban100, respectively. The qualitative comparisons demonstrate that our proposed model is able to complete masked regions and implement super-resolution reconstruction in videos of high quality. Furthermore, the above results show that our method has greater versatility both in video inpainting and super-resolution tasks compared to recent models.https://www.mdpi.com/2076-3417/13/18/10001video restorationvideo inpaintingsuper-resolutionnon-local
spellingShingle	Yuanfeng Zheng Yuchen Yan Hao Jiang Video-Restoration-Net: Deep Generative Model with Non-Local Network for Inpainting and Super-Resolution Tasks Applied Sciences video restoration video inpainting super-resolution non-local
title	Video-Restoration-Net: Deep Generative Model with Non-Local Network for Inpainting and Super-Resolution Tasks
title_full	Video-Restoration-Net: Deep Generative Model with Non-Local Network for Inpainting and Super-Resolution Tasks
title_fullStr	Video-Restoration-Net: Deep Generative Model with Non-Local Network for Inpainting and Super-Resolution Tasks
title_full_unstemmed	Video-Restoration-Net: Deep Generative Model with Non-Local Network for Inpainting and Super-Resolution Tasks
title_short	Video-Restoration-Net: Deep Generative Model with Non-Local Network for Inpainting and Super-Resolution Tasks
title_sort	video restoration net deep generative model with non local network for inpainting and super resolution tasks
topic	video restoration video inpainting super-resolution non-local
url	https://www.mdpi.com/2076-3417/13/18/10001
work_keys_str_mv	AT yuanfengzheng videorestorationnetdeepgenerativemodelwithnonlocalnetworkforinpaintingandsuperresolutiontasks AT yuchenyan videorestorationnetdeepgenerativemodelwithnonlocalnetworkforinpaintingandsuperresolutiontasks AT haojiang videorestorationnetdeepgenerativemodelwithnonlocalnetworkforinpaintingandsuperresolutiontasks

Video-Restoration-Net: Deep Generative Model with Non-Local Network for Inpainting and Super-Resolution Tasks

Similar Items