Real-Time Video Super-Resolution with Spatio-Temporal Modeling and Redundancy-Aware Inference

Video super-resolution aims to generate high-resolution frames from low-resolution counterparts. It can be regarded as a specialized application of image super-resolution, serving various purposes, such as video display and surveillance. This paper proposes a novel method for real-time video super-r...

Full description

Bibliographic Details
Main Authors: Wenhao Wang, Zhenbing Liu, Haoxiang Lu, Rushi Lan, Zhaoyuan Zhang
Format: Article
Language:English
Published: MDPI AG 2023-09-01
Series:Sensors
Subjects:
Online Access:https://www.mdpi.com/1424-8220/23/18/7880
_version_ 1797576989320675328
author Wenhao Wang
Zhenbing Liu
Haoxiang Lu
Rushi Lan
Zhaoyuan Zhang
author_facet Wenhao Wang
Zhenbing Liu
Haoxiang Lu
Rushi Lan
Zhaoyuan Zhang
author_sort Wenhao Wang
collection DOAJ
description Video super-resolution aims to generate high-resolution frames from low-resolution counterparts. It can be regarded as a specialized application of image super-resolution, serving various purposes, such as video display and surveillance. This paper proposes a novel method for real-time video super-resolution. It effectively exploits spatial information by utilizing the capabilities of an image super-resolution model and leverages the temporal information inherent in videos. Specifically, the method incorporates a pre-trained image super-resolution network as its foundational framework, allowing it to leverage existing expertise for super-resolution. A fast temporal information aggregation module is presented to further aggregate temporal cues across frames. By using deformable convolution to align features of neighboring frames, this module takes advantage of inter-frame dependency. In addition, it employs a hierarchical fast spatial offset feature extraction and a channel attention-based temporal fusion. A redundancy-aware inference algorithm is developed to reduce computational redundancy by reusing intermediate features, achieving real-time inferring speed. Extensive experiments on several benchmarks demonstrate that the proposed method can reconstruct satisfactory results with strong quantitative performance and visual qualities. The real-time inferring ability makes it suitable for real-world deployment.
first_indexed 2024-03-10T22:02:31Z
format Article
id doaj.art-987ece9b78ef43009f0aaef5e8e9ec63
institution Directory Open Access Journal
issn 1424-8220
language English
last_indexed 2024-03-10T22:02:31Z
publishDate 2023-09-01
publisher MDPI AG
record_format Article
series Sensors
spelling doaj.art-987ece9b78ef43009f0aaef5e8e9ec632023-11-19T12:55:30ZengMDPI AGSensors1424-82202023-09-012318788010.3390/s23187880Real-Time Video Super-Resolution with Spatio-Temporal Modeling and Redundancy-Aware InferenceWenhao Wang0Zhenbing Liu1Haoxiang Lu2Rushi Lan3Zhaoyuan Zhang4School of Computer Science and Information Security, Guilin University of Electronic Technology, Guilin 541004, ChinaSchool of Computer Science and Information Security, Guilin University of Electronic Technology, Guilin 541004, ChinaSchool of Computer Science and Information Security, Guilin University of Electronic Technology, Guilin 541004, ChinaSchool of Computer Science and Information Security, Guilin University of Electronic Technology, Guilin 541004, ChinaSchool of Computer Science and Information Security, Guilin University of Electronic Technology, Guilin 541004, ChinaVideo super-resolution aims to generate high-resolution frames from low-resolution counterparts. It can be regarded as a specialized application of image super-resolution, serving various purposes, such as video display and surveillance. This paper proposes a novel method for real-time video super-resolution. It effectively exploits spatial information by utilizing the capabilities of an image super-resolution model and leverages the temporal information inherent in videos. Specifically, the method incorporates a pre-trained image super-resolution network as its foundational framework, allowing it to leverage existing expertise for super-resolution. A fast temporal information aggregation module is presented to further aggregate temporal cues across frames. By using deformable convolution to align features of neighboring frames, this module takes advantage of inter-frame dependency. In addition, it employs a hierarchical fast spatial offset feature extraction and a channel attention-based temporal fusion. A redundancy-aware inference algorithm is developed to reduce computational redundancy by reusing intermediate features, achieving real-time inferring speed. Extensive experiments on several benchmarks demonstrate that the proposed method can reconstruct satisfactory results with strong quantitative performance and visual qualities. The real-time inferring ability makes it suitable for real-world deployment.https://www.mdpi.com/1424-8220/23/18/7880video super-resolutiontemporal aggregationdeformable convolutionredundancy-aware inferencedeep learning
spellingShingle Wenhao Wang
Zhenbing Liu
Haoxiang Lu
Rushi Lan
Zhaoyuan Zhang
Real-Time Video Super-Resolution with Spatio-Temporal Modeling and Redundancy-Aware Inference
Sensors
video super-resolution
temporal aggregation
deformable convolution
redundancy-aware inference
deep learning
title Real-Time Video Super-Resolution with Spatio-Temporal Modeling and Redundancy-Aware Inference
title_full Real-Time Video Super-Resolution with Spatio-Temporal Modeling and Redundancy-Aware Inference
title_fullStr Real-Time Video Super-Resolution with Spatio-Temporal Modeling and Redundancy-Aware Inference
title_full_unstemmed Real-Time Video Super-Resolution with Spatio-Temporal Modeling and Redundancy-Aware Inference
title_short Real-Time Video Super-Resolution with Spatio-Temporal Modeling and Redundancy-Aware Inference
title_sort real time video super resolution with spatio temporal modeling and redundancy aware inference
topic video super-resolution
temporal aggregation
deformable convolution
redundancy-aware inference
deep learning
url https://www.mdpi.com/1424-8220/23/18/7880
work_keys_str_mv AT wenhaowang realtimevideosuperresolutionwithspatiotemporalmodelingandredundancyawareinference
AT zhenbingliu realtimevideosuperresolutionwithspatiotemporalmodelingandredundancyawareinference
AT haoxianglu realtimevideosuperresolutionwithspatiotemporalmodelingandredundancyawareinference
AT rushilan realtimevideosuperresolutionwithspatiotemporalmodelingandredundancyawareinference
AT zhaoyuanzhang realtimevideosuperresolutionwithspatiotemporalmodelingandredundancyawareinference