Video-Restoration-Net: Deep Generative Model with Non-Local Network for Inpainting and Super-Resolution Tasks

Although deep learning-based approaches for video processing have been extensively investigated, the lack of generality in network construction makes it challenging for practical applications, particularly in video restoration. As a result, this paper presents a universal video restoration model tha...

Full description

Bibliographic Details
Main Authors: Yuanfeng Zheng, Yuchen Yan, Hao Jiang
Format: Article
Language:English
Published: MDPI AG 2023-09-01
Series:Applied Sciences
Subjects:
Online Access:https://www.mdpi.com/2076-3417/13/18/10001
_version_ 1797581450846928896
author Yuanfeng Zheng
Yuchen Yan
Hao Jiang
author_facet Yuanfeng Zheng
Yuchen Yan
Hao Jiang
author_sort Yuanfeng Zheng
collection DOAJ
description Although deep learning-based approaches for video processing have been extensively investigated, the lack of generality in network construction makes it challenging for practical applications, particularly in video restoration. As a result, this paper presents a universal video restoration model that can simultaneously tackle video inpainting and super-resolution tasks. The network, called Video-Restoration-Net (VRN), consists of four components: (1) an encoder to extract features from each frame, (2) a non-local network that recombines features from adjacent frames or different locations of a given frame, (3) a decoder to restore the coarse video from the output of a non-local block, and (4) a refinement network to refine the coarse video on the frame level. The framework is trained in a three-step pipeline to improve training stability for both tasks. Specifically, we first suggest an automated technique to generate full video datasets for super-resolution reconstruction and another complete-incomplete video dataset for inpainting, respectively. A VRN is then trained to inpaint the incomplete videos. Meanwhile, the full video datasets are adopted to train another VRN frame-wisely and validate it against authoritative datasets. We show quantitative comparisons with several baseline models, achieving 40.5042 dB/0.99473 on PSNR/SSIM in the inpainting task, while during the SR task we obtained 28.41 dB/0.7953 and 27.25/0.8152 on BSD100 and Urban100, respectively. The qualitative comparisons demonstrate that our proposed model is able to complete masked regions and implement super-resolution reconstruction in videos of high quality. Furthermore, the above results show that our method has greater versatility both in video inpainting and super-resolution tasks compared to recent models.
first_indexed 2024-03-10T23:05:38Z
format Article
id doaj.art-948da5024aa941258535d9cc860c760d
institution Directory Open Access Journal
issn 2076-3417
language English
last_indexed 2024-03-10T23:05:38Z
publishDate 2023-09-01
publisher MDPI AG
record_format Article
series Applied Sciences
spelling doaj.art-948da5024aa941258535d9cc860c760d2023-11-19T09:21:38ZengMDPI AGApplied Sciences2076-34172023-09-0113181000110.3390/app131810001Video-Restoration-Net: Deep Generative Model with Non-Local Network for Inpainting and Super-Resolution TasksYuanfeng Zheng0Yuchen Yan1Hao Jiang2School of Electronic Information, Wuhan University, Wuhan 430072, ChinaSchool of Electronic Information, Wuhan University, Wuhan 430072, ChinaSchool of Electronic Information, Wuhan University, Wuhan 430072, ChinaAlthough deep learning-based approaches for video processing have been extensively investigated, the lack of generality in network construction makes it challenging for practical applications, particularly in video restoration. As a result, this paper presents a universal video restoration model that can simultaneously tackle video inpainting and super-resolution tasks. The network, called Video-Restoration-Net (VRN), consists of four components: (1) an encoder to extract features from each frame, (2) a non-local network that recombines features from adjacent frames or different locations of a given frame, (3) a decoder to restore the coarse video from the output of a non-local block, and (4) a refinement network to refine the coarse video on the frame level. The framework is trained in a three-step pipeline to improve training stability for both tasks. Specifically, we first suggest an automated technique to generate full video datasets for super-resolution reconstruction and another complete-incomplete video dataset for inpainting, respectively. A VRN is then trained to inpaint the incomplete videos. Meanwhile, the full video datasets are adopted to train another VRN frame-wisely and validate it against authoritative datasets. We show quantitative comparisons with several baseline models, achieving 40.5042 dB/0.99473 on PSNR/SSIM in the inpainting task, while during the SR task we obtained 28.41 dB/0.7953 and 27.25/0.8152 on BSD100 and Urban100, respectively. The qualitative comparisons demonstrate that our proposed model is able to complete masked regions and implement super-resolution reconstruction in videos of high quality. Furthermore, the above results show that our method has greater versatility both in video inpainting and super-resolution tasks compared to recent models.https://www.mdpi.com/2076-3417/13/18/10001video restorationvideo inpaintingsuper-resolutionnon-local
spellingShingle Yuanfeng Zheng
Yuchen Yan
Hao Jiang
Video-Restoration-Net: Deep Generative Model with Non-Local Network for Inpainting and Super-Resolution Tasks
Applied Sciences
video restoration
video inpainting
super-resolution
non-local
title Video-Restoration-Net: Deep Generative Model with Non-Local Network for Inpainting and Super-Resolution Tasks
title_full Video-Restoration-Net: Deep Generative Model with Non-Local Network for Inpainting and Super-Resolution Tasks
title_fullStr Video-Restoration-Net: Deep Generative Model with Non-Local Network for Inpainting and Super-Resolution Tasks
title_full_unstemmed Video-Restoration-Net: Deep Generative Model with Non-Local Network for Inpainting and Super-Resolution Tasks
title_short Video-Restoration-Net: Deep Generative Model with Non-Local Network for Inpainting and Super-Resolution Tasks
title_sort video restoration net deep generative model with non local network for inpainting and super resolution tasks
topic video restoration
video inpainting
super-resolution
non-local
url https://www.mdpi.com/2076-3417/13/18/10001
work_keys_str_mv AT yuanfengzheng videorestorationnetdeepgenerativemodelwithnonlocalnetworkforinpaintingandsuperresolutiontasks
AT yuchenyan videorestorationnetdeepgenerativemodelwithnonlocalnetworkforinpaintingandsuperresolutiontasks
AT haojiang videorestorationnetdeepgenerativemodelwithnonlocalnetworkforinpaintingandsuperresolutiontasks