Deep Learning Approach for Seamless Navigation in Multi-View Streaming Applications

Quality of Experience (QoE) in multi-view streaming systems is known to be severely affected by the latency associated with view-switching procedures. Anticipating the navigation intentions of the viewer on the multi-view scene could provide the means to greatly reduce such latency. The research wor...

Full description

Bibliographic Details
Main Authors: Tiago S. Costa, Paula Viana, Maria T. Andrade
Format: Article
Language:English
Published: IEEE 2023-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/10235963/
_version_ 1797689565901750272
author Tiago S. Costa
Paula Viana
Maria T. Andrade
author_facet Tiago S. Costa
Paula Viana
Maria T. Andrade
author_sort Tiago S. Costa
collection DOAJ
description Quality of Experience (QoE) in multi-view streaming systems is known to be severely affected by the latency associated with view-switching procedures. Anticipating the navigation intentions of the viewer on the multi-view scene could provide the means to greatly reduce such latency. The research work presented in this article builds on this premise by proposing a new predictive view-selection mechanism. A VGG16-inspired Convolutional Neural Network (CNN) is used to identify the viewer’s focus of attention and determine which views would be most suited to be presented in the brief term, i.e., the near-term viewing intentions. This way, those views can be locally buffered before they are actually needed. To this aim, two datasets were used to evaluate the prediction performance and impact on latency, in particular when compared to the solution implemented in the previous version of our multi-view streaming system. Results obtained with this work translate into a generalized improvement in perceived QoE. A significant reduction in latency during view-switching procedures was effectively achieved. Moreover, results also demonstrated that the prediction of the user’s visual interest was achieved with a high level of accuracy. An experimental platform was also established on which future predictive models can be integrated and compared with previously implemented models.
first_indexed 2024-03-12T01:47:25Z
format Article
id doaj.art-66e21586208447d882511bf5ea60d884
institution Directory Open Access Journal
issn 2169-3536
language English
last_indexed 2024-03-12T01:47:25Z
publishDate 2023-01-01
publisher IEEE
record_format Article
series IEEE Access
spelling doaj.art-66e21586208447d882511bf5ea60d8842023-09-08T23:01:32ZengIEEEIEEE Access2169-35362023-01-0111938839389710.1109/ACCESS.2023.331082210235963Deep Learning Approach for Seamless Navigation in Multi-View Streaming ApplicationsTiago S. Costa0https://orcid.org/0000-0002-3778-8773Paula Viana1https://orcid.org/0000-0001-8447-2360Maria T. Andrade2https://orcid.org/0000-0002-1363-5027Centre for Telecommunications and Multimedia, INESC TEC, Porto, PortugalCentre for Telecommunications and Multimedia, INESC TEC, Porto, PortugalCentre for Telecommunications and Multimedia, INESC TEC, Porto, PortugalQuality of Experience (QoE) in multi-view streaming systems is known to be severely affected by the latency associated with view-switching procedures. Anticipating the navigation intentions of the viewer on the multi-view scene could provide the means to greatly reduce such latency. The research work presented in this article builds on this premise by proposing a new predictive view-selection mechanism. A VGG16-inspired Convolutional Neural Network (CNN) is used to identify the viewer’s focus of attention and determine which views would be most suited to be presented in the brief term, i.e., the near-term viewing intentions. This way, those views can be locally buffered before they are actually needed. To this aim, two datasets were used to evaluate the prediction performance and impact on latency, in particular when compared to the solution implemented in the previous version of our multi-view streaming system. Results obtained with this work translate into a generalized improvement in perceived QoE. A significant reduction in latency during view-switching procedures was effectively achieved. Moreover, results also demonstrated that the prediction of the user’s visual interest was achieved with a high level of accuracy. An experimental platform was also established on which future predictive models can be integrated and compared with previously implemented models.https://ieeexplore.ieee.org/document/10235963/Multimediastreamingmulti-viewfocus-of-attentiondeep learning
spellingShingle Tiago S. Costa
Paula Viana
Maria T. Andrade
Deep Learning Approach for Seamless Navigation in Multi-View Streaming Applications
IEEE Access
Multimedia
streaming
multi-view
focus-of-attention
deep learning
title Deep Learning Approach for Seamless Navigation in Multi-View Streaming Applications
title_full Deep Learning Approach for Seamless Navigation in Multi-View Streaming Applications
title_fullStr Deep Learning Approach for Seamless Navigation in Multi-View Streaming Applications
title_full_unstemmed Deep Learning Approach for Seamless Navigation in Multi-View Streaming Applications
title_short Deep Learning Approach for Seamless Navigation in Multi-View Streaming Applications
title_sort deep learning approach for seamless navigation in multi view streaming applications
topic Multimedia
streaming
multi-view
focus-of-attention
deep learning
url https://ieeexplore.ieee.org/document/10235963/
work_keys_str_mv AT tiagoscosta deeplearningapproachforseamlessnavigationinmultiviewstreamingapplications
AT paulaviana deeplearningapproachforseamlessnavigationinmultiviewstreamingapplications
AT mariatandrade deeplearningapproachforseamlessnavigationinmultiviewstreamingapplications