AVQBits—Adaptive Video Quality Model Based on Bitstream Information for Various Video Applications

The paper presents <inline-formula> <tex-math notation="LaTeX">$AVQBits$ </tex-math></inline-formula>, a versatile, bitstream-based video quality model. It can be applied in several contexts such as video service monitoring, evaluation of video encoding quality, of...

Full description

Bibliographic Details
Main Authors:	Rakesh Rao Ramachandra Rao, Steve Goring, Alexander Raake
Format:	Article
Language:	English
Published:	IEEE 2022-01-01
Series:	IEEE Access
Subjects:	Bitstream video quality models quality of experience (QoE) quality assessment HTTP-based adaptive streaming (HAS) hybrid models video quality
Online Access:	https://ieeexplore.ieee.org/document/9846967/

_version_	1798040481250148352
author	Rakesh Rao Ramachandra Rao Steve Goring Alexander Raake
author_facet	Rakesh Rao Ramachandra Rao Steve Goring Alexander Raake
author_sort	Rakesh Rao Ramachandra Rao
collection	DOAJ
description	The paper presents <inline-formula> <tex-math notation="LaTeX">$AVQBits$ </tex-math></inline-formula>, a versatile, bitstream-based video quality model. It can be applied in several contexts such as video service monitoring, evaluation of video encoding quality, of gaming video QoE, and even of omnidirectional video quality. In the paper, it is shown that <inline-formula> <tex-math notation="LaTeX">$AVQBits$ </tex-math></inline-formula> predictions closely match video quality ratings obained in various subjective tests with human viewers, for videos up to 4K-UHD resolution (Ultra-High Definition, 3840 x 2180 pixels) and framerates up 120 fps. With the different variants of <inline-formula> <tex-math notation="LaTeX">$AVQBits$ </tex-math></inline-formula> presented in the paper, video quality can be monitored either at the client side, in the network or directly after encoding. The no-reference <inline-formula> <tex-math notation="LaTeX">$AVQBits$ </tex-math></inline-formula> model was developed for different video services and types of input data, reflecting the increasing popularity of Video-on-Demand services and widespread use of HTTP-based adaptive streaming. At its core, <inline-formula> <tex-math notation="LaTeX">$AVQBits$ </tex-math></inline-formula> encompasses the standardized ITU-T P.1204.3 model, with further model instances that can either have restricted or extended input information, depending on the application context. Four different instances of <inline-formula> <tex-math notation="LaTeX">$AVQBits$ </tex-math></inline-formula> are presented, that is, a Mode 3 model with full access to the bitstream, a Mode 0 variant using only metadata such as codec type, framerate, resoution and bitrate as input, a Mode 1 model using Mode 0 information and frame-type and -size information, and a Hybrid Mode 0 model that is based on Mode 0 metadata and the decoded video pixel information. The models are trained on the authors’ own AVT-PNATS-UHD-1 dataset described in the paper. All models show a highly competitive performance by using AVT-VQDB-UHD-1 as validation dataset, e.g., with the Mode 0 variant yielding a value of 0.890 Pearson Correlation, the Mode 1 model of 0.901, the hybrid no-reference mode 0 model of 0.928 and the model with full bitstream access of 0.942. In addition, all four <inline-formula> <tex-math notation="LaTeX">$AVQBits$ </tex-math></inline-formula> variants are evaluated when applying them out-of-the-box to different media formats such as 360° video, high framerate (HFR) content, or gaming videos. The analysis shows that the ITU-T P.1204.3 and Hybrid Mode 0 instances of <inline-formula> <tex-math notation="LaTeX">$AVQBits$ </tex-math></inline-formula> for the considered use-cases either perform on par with or better than even state-of-the-art full reference, pixel-based models. Furthermore, it is shown that the proposed Mode 0 and Mode 1 variants outperform commonly used no-reference models for the different application scopes. Also, a long-term integration model based on the standardized ITU-T P.1203.3 is presented to estimate ratings of overall audiovisual streaming Quality of Experience (QoE) for sessions of 30 s up to 5 min duration. In the paper, the <inline-formula> <tex-math notation="LaTeX">$AVQBits$ </tex-math></inline-formula> instances with their per-1-sec score output are evaluated as the video quality component of the proposed long-term integration model. All <inline-formula> <tex-math notation="LaTeX">$AVQBits$ </tex-math></inline-formula> variants as well as the long-term integration module are made publicly available for the community for further research.
first_indexed	2024-04-11T22:08:12Z
format	Article
id	doaj.art-94f7723e375341d18febadd166e4a27c
institution	Directory Open Access Journal
issn	2169-3536
language	English
last_indexed	2024-04-11T22:08:12Z
publishDate	2022-01-01
publisher	IEEE
record_format	Article
series	IEEE Access
spelling	doaj.art-94f7723e375341d18febadd166e4a27c2022-12-22T04:00:38ZengIEEEIEEE Access2169-35362022-01-0110803218035110.1109/ACCESS.2022.31955279846967AVQBits—Adaptive Video Quality Model Based on Bitstream Information for Various Video ApplicationsRakesh Rao Ramachandra Rao0https://orcid.org/0000-0002-7069-1543Steve Goring1https://orcid.org/0000-0001-6810-6969Alexander Raake2https://orcid.org/0000-0002-9357-1763Audiovisual Technology Group, Technische Universität Ilmenau, Ilmenau, GermanyAudiovisual Technology Group, Technische Universität Ilmenau, Ilmenau, GermanyAudiovisual Technology Group, Technische Universität Ilmenau, Ilmenau, GermanyThe paper presents <inline-formula> <tex-math notation="LaTeX">$AVQBits$ </tex-math></inline-formula>, a versatile, bitstream-based video quality model. It can be applied in several contexts such as video service monitoring, evaluation of video encoding quality, of gaming video QoE, and even of omnidirectional video quality. In the paper, it is shown that <inline-formula> <tex-math notation="LaTeX">$AVQBits$ </tex-math></inline-formula> predictions closely match video quality ratings obained in various subjective tests with human viewers, for videos up to 4K-UHD resolution (Ultra-High Definition, 3840 x 2180 pixels) and framerates up 120 fps. With the different variants of <inline-formula> <tex-math notation="LaTeX">$AVQBits$ </tex-math></inline-formula> presented in the paper, video quality can be monitored either at the client side, in the network or directly after encoding. The no-reference <inline-formula> <tex-math notation="LaTeX">$AVQBits$ </tex-math></inline-formula> model was developed for different video services and types of input data, reflecting the increasing popularity of Video-on-Demand services and widespread use of HTTP-based adaptive streaming. At its core, <inline-formula> <tex-math notation="LaTeX">$AVQBits$ </tex-math></inline-formula> encompasses the standardized ITU-T P.1204.3 model, with further model instances that can either have restricted or extended input information, depending on the application context. Four different instances of <inline-formula> <tex-math notation="LaTeX">$AVQBits$ </tex-math></inline-formula> are presented, that is, a Mode 3 model with full access to the bitstream, a Mode 0 variant using only metadata such as codec type, framerate, resoution and bitrate as input, a Mode 1 model using Mode 0 information and frame-type and -size information, and a Hybrid Mode 0 model that is based on Mode 0 metadata and the decoded video pixel information. The models are trained on the authors’ own AVT-PNATS-UHD-1 dataset described in the paper. All models show a highly competitive performance by using AVT-VQDB-UHD-1 as validation dataset, e.g., with the Mode 0 variant yielding a value of 0.890 Pearson Correlation, the Mode 1 model of 0.901, the hybrid no-reference mode 0 model of 0.928 and the model with full bitstream access of 0.942. In addition, all four <inline-formula> <tex-math notation="LaTeX">$AVQBits$ </tex-math></inline-formula> variants are evaluated when applying them out-of-the-box to different media formats such as 360° video, high framerate (HFR) content, or gaming videos. The analysis shows that the ITU-T P.1204.3 and Hybrid Mode 0 instances of <inline-formula> <tex-math notation="LaTeX">$AVQBits$ </tex-math></inline-formula> for the considered use-cases either perform on par with or better than even state-of-the-art full reference, pixel-based models. Furthermore, it is shown that the proposed Mode 0 and Mode 1 variants outperform commonly used no-reference models for the different application scopes. Also, a long-term integration model based on the standardized ITU-T P.1203.3 is presented to estimate ratings of overall audiovisual streaming Quality of Experience (QoE) for sessions of 30 s up to 5 min duration. In the paper, the <inline-formula> <tex-math notation="LaTeX">$AVQBits$ </tex-math></inline-formula> instances with their per-1-sec score output are evaluated as the video quality component of the proposed long-term integration model. All <inline-formula> <tex-math notation="LaTeX">$AVQBits$ </tex-math></inline-formula> variants as well as the long-term integration module are made publicly available for the community for further research.https://ieeexplore.ieee.org/document/9846967/Bitstream video quality modelsquality of experience (QoE)quality assessmentHTTP-based adaptive streaming (HAS)hybrid modelsvideo quality
spellingShingle	Rakesh Rao Ramachandra Rao Steve Goring Alexander Raake AVQBits—Adaptive Video Quality Model Based on Bitstream Information for Various Video Applications IEEE Access Bitstream video quality models quality of experience (QoE) quality assessment HTTP-based adaptive streaming (HAS) hybrid models video quality
title	AVQBits—Adaptive Video Quality Model Based on Bitstream Information for Various Video Applications
title_full	AVQBits—Adaptive Video Quality Model Based on Bitstream Information for Various Video Applications
title_fullStr	AVQBits—Adaptive Video Quality Model Based on Bitstream Information for Various Video Applications
title_full_unstemmed	AVQBits—Adaptive Video Quality Model Based on Bitstream Information for Various Video Applications
title_short	AVQBits—Adaptive Video Quality Model Based on Bitstream Information for Various Video Applications
title_sort	avqbits x2014 adaptive video quality model based on bitstream information for various video applications
topic	Bitstream video quality models quality of experience (QoE) quality assessment HTTP-based adaptive streaming (HAS) hybrid models video quality
url	https://ieeexplore.ieee.org/document/9846967/
work_keys_str_mv	AT rakeshraoramachandrarao avqbitsx2014adaptivevideoqualitymodelbasedonbitstreaminformationforvariousvideoapplications AT stevegoring avqbitsx2014adaptivevideoqualitymodelbasedonbitstreaminformationforvariousvideoapplications AT alexanderraake avqbitsx2014adaptivevideoqualitymodelbasedonbitstreaminformationforvariousvideoapplications

AVQBits&#x2014;Adaptive Video Quality Model Based on Bitstream Information for Various Video Applications

Similar Items

AVQBits—Adaptive Video Quality Model Based on Bitstream Information for Various Video Applications