Lenslet Image Coding With SAIs Synthesis via 3D CNNs-Based Reinforcement Learning With a Rate Reward

The deep learning-based coding schemes for lenslet images combine coding standards and view synthesis through Deep Learning (DL) models, where the compression efficiency is heavily influenced by the coding structure and quality of synthesized views. To exploit the inter-view redundancy among Sub-Ape...

Full description

Bibliographic Details
Main Authors: Xiaoda Zhong, Tao Lu, Diyang Xiao, Rui Zhong
Format: Article
Language:English
Published: IEEE 2023-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/10151925/
_version_ 1797794173100752896
author Xiaoda Zhong
Tao Lu
Diyang Xiao
Rui Zhong
author_facet Xiaoda Zhong
Tao Lu
Diyang Xiao
Rui Zhong
author_sort Xiaoda Zhong
collection DOAJ
description The deep learning-based coding schemes for lenslet images combine coding standards and view synthesis through Deep Learning (DL) models, where the compression efficiency is heavily influenced by the coding structure and quality of synthesized views. To exploit the inter-view redundancy among Sub-Aperture Images (SAIs), this paper proposes a hybrid closed-loop coding system that uses a novel coding structure based on checkerboard interleaving at a frame level. The frame-wise checkerboard interleaving method partitions an Original SAIs’ Set (OSS) of images into two mutually exclusive subsets, each consisting of alternating rows and columns of SAIs. We utilize the video coding standard Versatile Video Coding (VVC) to encode one subset while proposing a novel rate constraint-reinforced 3D Convolutional Neural Networks (CNNs) to predict the other subset, referred to as the complement subset. The rate constraint-reinforced 3D CNNs is newly designed with a gradient loss and reinforced rate cost to improve synthesized SAIs’ image quality and bit cost saving simultaneously. Experimental results on the light field image dataset demonstrate that the proposed hybrid coding system outperforms both HEVC_LDP and the previous state-of-the-art (SOTA), achieving an average BD-Bitrate savings of 41.58% and 23.31%, respectively.
first_indexed 2024-03-13T02:57:57Z
format Article
id doaj.art-c1c47d88044544a08383291f789354d2
institution Directory Open Access Journal
issn 2169-3536
language English
last_indexed 2024-03-13T02:57:57Z
publishDate 2023-01-01
publisher IEEE
record_format Article
series IEEE Access
spelling doaj.art-c1c47d88044544a08383291f789354d22023-06-27T23:00:30ZengIEEEIEEE Access2169-35362023-01-0111624626247110.1109/ACCESS.2023.328629810151925Lenslet Image Coding With SAIs Synthesis via 3D CNNs-Based Reinforcement Learning With a Rate RewardXiaoda Zhong0https://orcid.org/0000-0002-4122-0670Tao Lu1Diyang Xiao2https://orcid.org/0009-0006-8650-4497Rui Zhong3Shien-Ming Wu School of Intelligent Engineering, South China University of Technology, Guangzhou, ChinaHubei Key Laboratory of Intelligent Robot, Wuhan Institute of Technology, Wuhan, ChinaSchool of Computer Science, Central China Normal University, Wuhan, ChinaSchool of Computer Science, Central China Normal University, Wuhan, ChinaThe deep learning-based coding schemes for lenslet images combine coding standards and view synthesis through Deep Learning (DL) models, where the compression efficiency is heavily influenced by the coding structure and quality of synthesized views. To exploit the inter-view redundancy among Sub-Aperture Images (SAIs), this paper proposes a hybrid closed-loop coding system that uses a novel coding structure based on checkerboard interleaving at a frame level. The frame-wise checkerboard interleaving method partitions an Original SAIs’ Set (OSS) of images into two mutually exclusive subsets, each consisting of alternating rows and columns of SAIs. We utilize the video coding standard Versatile Video Coding (VVC) to encode one subset while proposing a novel rate constraint-reinforced 3D Convolutional Neural Networks (CNNs) to predict the other subset, referred to as the complement subset. The rate constraint-reinforced 3D CNNs is newly designed with a gradient loss and reinforced rate cost to improve synthesized SAIs’ image quality and bit cost saving simultaneously. Experimental results on the light field image dataset demonstrate that the proposed hybrid coding system outperforms both HEVC_LDP and the previous state-of-the-art (SOTA), achieving an average BD-Bitrate savings of 41.58% and 23.31%, respectively.https://ieeexplore.ieee.org/document/10151925/Lenslet imagecompressionreinforcement learning3D CNNsVVC
spellingShingle Xiaoda Zhong
Tao Lu
Diyang Xiao
Rui Zhong
Lenslet Image Coding With SAIs Synthesis via 3D CNNs-Based Reinforcement Learning With a Rate Reward
IEEE Access
Lenslet image
compression
reinforcement learning
3D CNNs
VVC
title Lenslet Image Coding With SAIs Synthesis via 3D CNNs-Based Reinforcement Learning With a Rate Reward
title_full Lenslet Image Coding With SAIs Synthesis via 3D CNNs-Based Reinforcement Learning With a Rate Reward
title_fullStr Lenslet Image Coding With SAIs Synthesis via 3D CNNs-Based Reinforcement Learning With a Rate Reward
title_full_unstemmed Lenslet Image Coding With SAIs Synthesis via 3D CNNs-Based Reinforcement Learning With a Rate Reward
title_short Lenslet Image Coding With SAIs Synthesis via 3D CNNs-Based Reinforcement Learning With a Rate Reward
title_sort lenslet image coding with sais synthesis via 3d cnns based reinforcement learning with a rate reward
topic Lenslet image
compression
reinforcement learning
3D CNNs
VVC
url https://ieeexplore.ieee.org/document/10151925/
work_keys_str_mv AT xiaodazhong lensletimagecodingwithsaissynthesisvia3dcnnsbasedreinforcementlearningwitharatereward
AT taolu lensletimagecodingwithsaissynthesisvia3dcnnsbasedreinforcementlearningwitharatereward
AT diyangxiao lensletimagecodingwithsaissynthesisvia3dcnnsbasedreinforcementlearningwitharatereward
AT ruizhong lensletimagecodingwithsaissynthesisvia3dcnnsbasedreinforcementlearningwitharatereward