AVQBits—Adaptive Video Quality Model Based on Bitstream Information for Various Video Applications

The paper presents <inline-formula> <tex-math notation="LaTeX">$AVQBits$ </tex-math></inline-formula>, a versatile, bitstream-based video quality model. It can be applied in several contexts such as video service monitoring, evaluation of video encoding quality, of...

Full description

Bibliographic Details
Main Authors: Rakesh Rao Ramachandra Rao, Steve Goring, Alexander Raake
Format: Article
Language:English
Published: IEEE 2022-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/9846967/
_version_ 1798040481250148352
author Rakesh Rao Ramachandra Rao
Steve Goring
Alexander Raake
author_facet Rakesh Rao Ramachandra Rao
Steve Goring
Alexander Raake
author_sort Rakesh Rao Ramachandra Rao
collection DOAJ
description The paper presents <inline-formula> <tex-math notation="LaTeX">$AVQBits$ </tex-math></inline-formula>, a versatile, bitstream-based video quality model. It can be applied in several contexts such as video service monitoring, evaluation of video encoding quality, of gaming video QoE, and even of omnidirectional video quality. In the paper, it is shown that <inline-formula> <tex-math notation="LaTeX">$AVQBits$ </tex-math></inline-formula> predictions closely match video quality ratings obained in various subjective tests with human viewers, for videos up to 4K-UHD resolution (Ultra-High Definition, 3840 x 2180 pixels) and framerates up 120 fps. With the different variants of <inline-formula> <tex-math notation="LaTeX">$AVQBits$ </tex-math></inline-formula> presented in the paper, video quality can be monitored either at the client side, in the network or directly after encoding. The no-reference <inline-formula> <tex-math notation="LaTeX">$AVQBits$ </tex-math></inline-formula> model was developed for different video services and types of input data, reflecting the increasing popularity of Video-on-Demand services and widespread use of HTTP-based adaptive streaming. At its core, <inline-formula> <tex-math notation="LaTeX">$AVQBits$ </tex-math></inline-formula> encompasses the standardized ITU-T P.1204.3 model, with further model instances that can either have restricted or extended input information, depending on the application context. Four different instances of <inline-formula> <tex-math notation="LaTeX">$AVQBits$ </tex-math></inline-formula> are presented, that is, a Mode 3 model with full access to the bitstream, a Mode 0 variant using only metadata such as codec type, framerate, resoution and bitrate as input, a Mode 1 model using Mode 0 information and frame-type and -size information, and a Hybrid Mode 0 model that is based on Mode 0 metadata and the decoded video pixel information. The models are trained on the authors&#x2019; own AVT-PNATS-UHD-1 dataset described in the paper. All models show a highly competitive performance by using AVT-VQDB-UHD-1 as validation dataset, e.g., with the Mode 0 variant yielding a value of 0.890 Pearson Correlation, the Mode 1 model of 0.901, the hybrid no-reference mode 0 model of 0.928 and the model with full bitstream access of 0.942. In addition, all four <inline-formula> <tex-math notation="LaTeX">$AVQBits$ </tex-math></inline-formula> variants are evaluated when applying them out-of-the-box to different media formats such as 360&#x00B0; video, high framerate (HFR) content, or gaming videos. The analysis shows that the ITU-T P.1204.3 and Hybrid Mode 0 instances of <inline-formula> <tex-math notation="LaTeX">$AVQBits$ </tex-math></inline-formula> for the considered use-cases either perform on par with or better than even state-of-the-art full reference, pixel-based models. Furthermore, it is shown that the proposed Mode 0 and Mode 1 variants outperform commonly used no-reference models for the different application scopes. Also, a long-term integration model based on the standardized ITU-T P.1203.3 is presented to estimate ratings of overall audiovisual streaming Quality of Experience (QoE) for sessions of 30 s up to 5 min duration. In the paper, the <inline-formula> <tex-math notation="LaTeX">$AVQBits$ </tex-math></inline-formula> instances with their per-1-sec score output are evaluated as the video quality component of the proposed long-term integration model. All <inline-formula> <tex-math notation="LaTeX">$AVQBits$ </tex-math></inline-formula> variants as well as the long-term integration module are made publicly available for the community for further research.
first_indexed 2024-04-11T22:08:12Z
format Article
id doaj.art-94f7723e375341d18febadd166e4a27c
institution Directory Open Access Journal
issn 2169-3536
language English
last_indexed 2024-04-11T22:08:12Z
publishDate 2022-01-01
publisher IEEE
record_format Article
series IEEE Access
spelling doaj.art-94f7723e375341d18febadd166e4a27c2022-12-22T04:00:38ZengIEEEIEEE Access2169-35362022-01-0110803218035110.1109/ACCESS.2022.31955279846967AVQBits&#x2014;Adaptive Video Quality Model Based on Bitstream Information for Various Video ApplicationsRakesh Rao Ramachandra Rao0https://orcid.org/0000-0002-7069-1543Steve Goring1https://orcid.org/0000-0001-6810-6969Alexander Raake2https://orcid.org/0000-0002-9357-1763Audiovisual Technology Group, Technische Universit&#x00E4;t Ilmenau, Ilmenau, GermanyAudiovisual Technology Group, Technische Universit&#x00E4;t Ilmenau, Ilmenau, GermanyAudiovisual Technology Group, Technische Universit&#x00E4;t Ilmenau, Ilmenau, GermanyThe paper presents <inline-formula> <tex-math notation="LaTeX">$AVQBits$ </tex-math></inline-formula>, a versatile, bitstream-based video quality model. It can be applied in several contexts such as video service monitoring, evaluation of video encoding quality, of gaming video QoE, and even of omnidirectional video quality. In the paper, it is shown that <inline-formula> <tex-math notation="LaTeX">$AVQBits$ </tex-math></inline-formula> predictions closely match video quality ratings obained in various subjective tests with human viewers, for videos up to 4K-UHD resolution (Ultra-High Definition, 3840 x 2180 pixels) and framerates up 120 fps. With the different variants of <inline-formula> <tex-math notation="LaTeX">$AVQBits$ </tex-math></inline-formula> presented in the paper, video quality can be monitored either at the client side, in the network or directly after encoding. The no-reference <inline-formula> <tex-math notation="LaTeX">$AVQBits$ </tex-math></inline-formula> model was developed for different video services and types of input data, reflecting the increasing popularity of Video-on-Demand services and widespread use of HTTP-based adaptive streaming. At its core, <inline-formula> <tex-math notation="LaTeX">$AVQBits$ </tex-math></inline-formula> encompasses the standardized ITU-T P.1204.3 model, with further model instances that can either have restricted or extended input information, depending on the application context. Four different instances of <inline-formula> <tex-math notation="LaTeX">$AVQBits$ </tex-math></inline-formula> are presented, that is, a Mode 3 model with full access to the bitstream, a Mode 0 variant using only metadata such as codec type, framerate, resoution and bitrate as input, a Mode 1 model using Mode 0 information and frame-type and -size information, and a Hybrid Mode 0 model that is based on Mode 0 metadata and the decoded video pixel information. The models are trained on the authors&#x2019; own AVT-PNATS-UHD-1 dataset described in the paper. All models show a highly competitive performance by using AVT-VQDB-UHD-1 as validation dataset, e.g., with the Mode 0 variant yielding a value of 0.890 Pearson Correlation, the Mode 1 model of 0.901, the hybrid no-reference mode 0 model of 0.928 and the model with full bitstream access of 0.942. In addition, all four <inline-formula> <tex-math notation="LaTeX">$AVQBits$ </tex-math></inline-formula> variants are evaluated when applying them out-of-the-box to different media formats such as 360&#x00B0; video, high framerate (HFR) content, or gaming videos. The analysis shows that the ITU-T P.1204.3 and Hybrid Mode 0 instances of <inline-formula> <tex-math notation="LaTeX">$AVQBits$ </tex-math></inline-formula> for the considered use-cases either perform on par with or better than even state-of-the-art full reference, pixel-based models. Furthermore, it is shown that the proposed Mode 0 and Mode 1 variants outperform commonly used no-reference models for the different application scopes. Also, a long-term integration model based on the standardized ITU-T P.1203.3 is presented to estimate ratings of overall audiovisual streaming Quality of Experience (QoE) for sessions of 30 s up to 5 min duration. In the paper, the <inline-formula> <tex-math notation="LaTeX">$AVQBits$ </tex-math></inline-formula> instances with their per-1-sec score output are evaluated as the video quality component of the proposed long-term integration model. All <inline-formula> <tex-math notation="LaTeX">$AVQBits$ </tex-math></inline-formula> variants as well as the long-term integration module are made publicly available for the community for further research.https://ieeexplore.ieee.org/document/9846967/Bitstream video quality modelsquality of experience (QoE)quality assessmentHTTP-based adaptive streaming (HAS)hybrid modelsvideo quality
spellingShingle Rakesh Rao Ramachandra Rao
Steve Goring
Alexander Raake
AVQBits&#x2014;Adaptive Video Quality Model Based on Bitstream Information for Various Video Applications
IEEE Access
Bitstream video quality models
quality of experience (QoE)
quality assessment
HTTP-based adaptive streaming (HAS)
hybrid models
video quality
title AVQBits&#x2014;Adaptive Video Quality Model Based on Bitstream Information for Various Video Applications
title_full AVQBits&#x2014;Adaptive Video Quality Model Based on Bitstream Information for Various Video Applications
title_fullStr AVQBits&#x2014;Adaptive Video Quality Model Based on Bitstream Information for Various Video Applications
title_full_unstemmed AVQBits&#x2014;Adaptive Video Quality Model Based on Bitstream Information for Various Video Applications
title_short AVQBits&#x2014;Adaptive Video Quality Model Based on Bitstream Information for Various Video Applications
title_sort avqbits x2014 adaptive video quality model based on bitstream information for various video applications
topic Bitstream video quality models
quality of experience (QoE)
quality assessment
HTTP-based adaptive streaming (HAS)
hybrid models
video quality
url https://ieeexplore.ieee.org/document/9846967/
work_keys_str_mv AT rakeshraoramachandrarao avqbitsx2014adaptivevideoqualitymodelbasedonbitstreaminformationforvariousvideoapplications
AT stevegoring avqbitsx2014adaptivevideoqualitymodelbasedonbitstreaminformationforvariousvideoapplications
AT alexanderraake avqbitsx2014adaptivevideoqualitymodelbasedonbitstreaminformationforvariousvideoapplications