Attention Networks for the Quality Enhancement of Light Field Images

In this paper, we propose a novel filtering method based on deep attention networks for the quality enhancement of light field (LF) images captured by plenoptic cameras and compressed using the High Efficiency Video Coding (HEVC) standard. The proposed architecture was built using efficient complex...

Full description

Bibliographic Details
Main Authors: Ionut Schiopu, Adrian Munteanu
Format: Article
Language:English
Published: MDPI AG 2021-05-01
Series:Sensors
Subjects:
Online Access:https://www.mdpi.com/1424-8220/21/9/3246
_version_ 1797534962347409408
author Ionut Schiopu
Adrian Munteanu
author_facet Ionut Schiopu
Adrian Munteanu
author_sort Ionut Schiopu
collection DOAJ
description In this paper, we propose a novel filtering method based on deep attention networks for the quality enhancement of light field (LF) images captured by plenoptic cameras and compressed using the High Efficiency Video Coding (HEVC) standard. The proposed architecture was built using efficient complex processing blocks and novel attention-based residual blocks. The network takes advantage of the macro-pixel (MP) structure, specific to LF images, and processes each reconstructed MP in the luminance (Y) channel. The input patch is represented as a tensor that collects, from an MP neighbourhood, four Epipolar Plane Images (EPIs) at four different angles. The experimental results on a common LF image database showed high improvements over HEVC in terms of the structural similarity index (SSIM), with an average Y-Bjøntegaard Delta (BD)-rate savings of <inline-formula><math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mn>36.57</mn><mo>%</mo><mo>,</mo></mrow></semantics></math></inline-formula> and an average Y-BD-PSNR improvement of <inline-formula><math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mn>2.301</mn></mrow></semantics></math></inline-formula> dB. Increased performance was achieved when the HEVC built-in filtering methods were skipped. The visual results illustrate that the enhanced image contains sharper edges and more texture details. The ablation study provides two robust solutions to reduce the inference time by <inline-formula><math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mn>44.6</mn><mo>%</mo></mrow></semantics></math></inline-formula> and the network complexity by <inline-formula><math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mn>74.7</mn><mo>%</mo></mrow></semantics></math></inline-formula>. The results demonstrate the potential of attention networks for the quality enhancement of LF images encoded by HEVC.
first_indexed 2024-03-10T11:37:56Z
format Article
id doaj.art-d4aa81f7c58a4b858f473f8a4e2c2531
institution Directory Open Access Journal
issn 1424-8220
language English
last_indexed 2024-03-10T11:37:56Z
publishDate 2021-05-01
publisher MDPI AG
record_format Article
series Sensors
spelling doaj.art-d4aa81f7c58a4b858f473f8a4e2c25312023-11-21T18:44:11ZengMDPI AGSensors1424-82202021-05-01219324610.3390/s21093246Attention Networks for the Quality Enhancement of Light Field ImagesIonut Schiopu0Adrian Munteanu1Department of Electronics and Informatics (ETRO), Vrije Universiteit Brussel (VUB), Pleinlaan 2, 1050 Brussels, BelgiumDepartment of Electronics and Informatics (ETRO), Vrije Universiteit Brussel (VUB), Pleinlaan 2, 1050 Brussels, BelgiumIn this paper, we propose a novel filtering method based on deep attention networks for the quality enhancement of light field (LF) images captured by plenoptic cameras and compressed using the High Efficiency Video Coding (HEVC) standard. The proposed architecture was built using efficient complex processing blocks and novel attention-based residual blocks. The network takes advantage of the macro-pixel (MP) structure, specific to LF images, and processes each reconstructed MP in the luminance (Y) channel. The input patch is represented as a tensor that collects, from an MP neighbourhood, four Epipolar Plane Images (EPIs) at four different angles. The experimental results on a common LF image database showed high improvements over HEVC in terms of the structural similarity index (SSIM), with an average Y-Bjøntegaard Delta (BD)-rate savings of <inline-formula><math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mn>36.57</mn><mo>%</mo><mo>,</mo></mrow></semantics></math></inline-formula> and an average Y-BD-PSNR improvement of <inline-formula><math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mn>2.301</mn></mrow></semantics></math></inline-formula> dB. Increased performance was achieved when the HEVC built-in filtering methods were skipped. The visual results illustrate that the enhanced image contains sharper edges and more texture details. The ablation study provides two robust solutions to reduce the inference time by <inline-formula><math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mn>44.6</mn><mo>%</mo></mrow></semantics></math></inline-formula> and the network complexity by <inline-formula><math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mn>74.7</mn><mo>%</mo></mrow></semantics></math></inline-formula>. The results demonstrate the potential of attention networks for the quality enhancement of LF images encoded by HEVC.https://www.mdpi.com/1424-8220/21/9/3246attention networkquality enhancementlight field imagesvideo coding
spellingShingle Ionut Schiopu
Adrian Munteanu
Attention Networks for the Quality Enhancement of Light Field Images
Sensors
attention network
quality enhancement
light field images
video coding
title Attention Networks for the Quality Enhancement of Light Field Images
title_full Attention Networks for the Quality Enhancement of Light Field Images
title_fullStr Attention Networks for the Quality Enhancement of Light Field Images
title_full_unstemmed Attention Networks for the Quality Enhancement of Light Field Images
title_short Attention Networks for the Quality Enhancement of Light Field Images
title_sort attention networks for the quality enhancement of light field images
topic attention network
quality enhancement
light field images
video coding
url https://www.mdpi.com/1424-8220/21/9/3246
work_keys_str_mv AT ionutschiopu attentionnetworksforthequalityenhancementoflightfieldimages
AT adrianmunteanu attentionnetworksforthequalityenhancementoflightfieldimages