Fast CU Decision Algorithm Based on CNN and Decision Trees for VVC

Compared with the previous generation of High Efficiency Video Coding (HEVC), Versatile Video Coding (VVC) introduces a quadtree and multi-type tree (QTMT) partition structure with nested multi-class trees so that the coding unit (CU) partition can better match the video texture features. This parti...

Full description

Bibliographic Details
Main Authors: Hongchan Li, Peng Zhang, Baohua Jin, Qiuwen Zhang
Format: Article
Language:English
Published: MDPI AG 2023-07-01
Series:Electronics
Subjects:
Online Access:https://www.mdpi.com/2079-9292/12/14/3053
_version_ 1797589539373449216
author Hongchan Li
Peng Zhang
Baohua Jin
Qiuwen Zhang
author_facet Hongchan Li
Peng Zhang
Baohua Jin
Qiuwen Zhang
author_sort Hongchan Li
collection DOAJ
description Compared with the previous generation of High Efficiency Video Coding (HEVC), Versatile Video Coding (VVC) introduces a quadtree and multi-type tree (QTMT) partition structure with nested multi-class trees so that the coding unit (CU) partition can better match the video texture features. This partition structure makes the compression efficiency of VVC significantly improved, but the computational complexity is also significantly increased, resulting in an increase in encoding time. Therefore, we propose a fast CU partition decision algorithm based on DenseNet network and decision tree (DT) classifier to reduce the coding complexity of VVC and save more coding time. We extract spatial feature vectors based on the DenseNet network model. Spatial feature vectors are constructed by predicting the boundary probabilities of 4 × 4 blocks in 64 × 64 coding units. Then, using the spatial features as the input of the DT classifier, through the classification function of the DT classifier model, the top N division modes with higher prediction probability are selected, and other division modes are skipped to reduce the computational complexity. Finally, the optimal partition mode is selected by comparing the RD cost. Our proposed algorithm achieves 47.6% encoding time savings on VTM10.0, while BDBR only increases by 0.91%.
first_indexed 2024-03-11T01:07:54Z
format Article
id doaj.art-97e9ac6b50f04747af65149a89b88a92
institution Directory Open Access Journal
issn 2079-9292
language English
last_indexed 2024-03-11T01:07:54Z
publishDate 2023-07-01
publisher MDPI AG
record_format Article
series Electronics
spelling doaj.art-97e9ac6b50f04747af65149a89b88a922023-11-18T19:05:09ZengMDPI AGElectronics2079-92922023-07-011214305310.3390/electronics12143053Fast CU Decision Algorithm Based on CNN and Decision Trees for VVCHongchan Li0Peng Zhang1Baohua Jin2Qiuwen Zhang3College of Computer and Communication Engineering, Zhengzhou University of Light Industry, Zhengzhou 450002, ChinaCollege of Computer and Communication Engineering, Zhengzhou University of Light Industry, Zhengzhou 450002, ChinaCollege of Computer and Communication Engineering, Zhengzhou University of Light Industry, Zhengzhou 450002, ChinaCollege of Computer and Communication Engineering, Zhengzhou University of Light Industry, Zhengzhou 450002, ChinaCompared with the previous generation of High Efficiency Video Coding (HEVC), Versatile Video Coding (VVC) introduces a quadtree and multi-type tree (QTMT) partition structure with nested multi-class trees so that the coding unit (CU) partition can better match the video texture features. This partition structure makes the compression efficiency of VVC significantly improved, but the computational complexity is also significantly increased, resulting in an increase in encoding time. Therefore, we propose a fast CU partition decision algorithm based on DenseNet network and decision tree (DT) classifier to reduce the coding complexity of VVC and save more coding time. We extract spatial feature vectors based on the DenseNet network model. Spatial feature vectors are constructed by predicting the boundary probabilities of 4 × 4 blocks in 64 × 64 coding units. Then, using the spatial features as the input of the DT classifier, through the classification function of the DT classifier model, the top N division modes with higher prediction probability are selected, and other division modes are skipped to reduce the computational complexity. Finally, the optimal partition mode is selected by comparing the RD cost. Our proposed algorithm achieves 47.6% encoding time savings on VTM10.0, while BDBR only increases by 0.91%.https://www.mdpi.com/2079-9292/12/14/3053versatile video codingdecision treesQTMTDenseNet
spellingShingle Hongchan Li
Peng Zhang
Baohua Jin
Qiuwen Zhang
Fast CU Decision Algorithm Based on CNN and Decision Trees for VVC
Electronics
versatile video coding
decision trees
QTMT
DenseNet
title Fast CU Decision Algorithm Based on CNN and Decision Trees for VVC
title_full Fast CU Decision Algorithm Based on CNN and Decision Trees for VVC
title_fullStr Fast CU Decision Algorithm Based on CNN and Decision Trees for VVC
title_full_unstemmed Fast CU Decision Algorithm Based on CNN and Decision Trees for VVC
title_short Fast CU Decision Algorithm Based on CNN and Decision Trees for VVC
title_sort fast cu decision algorithm based on cnn and decision trees for vvc
topic versatile video coding
decision trees
QTMT
DenseNet
url https://www.mdpi.com/2079-9292/12/14/3053
work_keys_str_mv AT hongchanli fastcudecisionalgorithmbasedoncnnanddecisiontreesforvvc
AT pengzhang fastcudecisionalgorithmbasedoncnnanddecisiontreesforvvc
AT baohuajin fastcudecisionalgorithmbasedoncnnanddecisiontreesforvvc
AT qiuwenzhang fastcudecisionalgorithmbasedoncnnanddecisiontreesforvvc