Answering Queries using Views over Probabilistic XML: Complexity and Tractability

We study the complexity of query answering using views in a probabilistic XML setting, identifying large classes of XPath queries -- with child and descendant navigation and predicates -- for which there are efficient (PTime) algorithms. We consider this problem under the two possible semantics for...

Ամբողջական նկարագրություն

Մատենագիտական մանրամասներ
Հիմնական հեղինակներ:	Cautis, B, Kharlamov, E
Ձևաչափ:	Journal article
Լեզու:	English
Հրապարակվել է:	2012

_version_	1826289876121681920
author	Cautis, B Kharlamov, E
author_facet	Cautis, B Kharlamov, E
author_sort	Cautis, B
collection	OXFORD
description	We study the complexity of query answering using views in a probabilistic XML setting, identifying large classes of XPath queries -- with child and descendant navigation and predicates -- for which there are efficient (PTime) algorithms. We consider this problem under the two possible semantics for XML query results: with persistent node identifiers and in their absence. Accordingly, we consider rewritings that can exploit a single view, by means of compensation, and rewritings that can use multiple views, by means of intersection. Since in a probabilistic setting queries return answers with probabilities, the problem of rewriting goes beyond the classic one of retrieving XML answers from views. For both semantics of XML queries, we show that, even when XML answers can be retrieved from views, their probabilities may not be computable. For rewritings that use only compensation, we describe a PTime decision procedure, based on easily verifiable criteria that distinguish between the feasible cases -- when probabilistic XML results are computable -- and the unfeasible ones. For rewritings that can use multiple views, with compensation and intersection, we identify the most permissive conditions that make probabilistic rewriting feasible, and we describe an algorithm that is sound in general, and becomes complete under fairly permissive restrictions, running in PTime modulo worst-case exponential time equivalence tests. This is the best we can hope for since intersection makes query equivalence intractable already over deterministic data. Our algorithm runs in PTime whenever deterministic rewritings can be found in PTime.
first_indexed	2024-03-07T02:35:35Z
format	Journal article
id	oxford-uuid:a8a9c76e-639c-4484-8836-1c97c7d82e86
institution	University of Oxford
language	English
last_indexed	2024-03-07T02:35:35Z
publishDate	2012
record_format	dspace
spelling	oxford-uuid:a8a9c76e-639c-4484-8836-1c97c7d82e862022-03-27T03:03:13ZAnswering Queries using Views over Probabilistic XML: Complexity and TractabilityJournal articlehttp://purl.org/coar/resource_type/c_dcae04bcuuid:a8a9c76e-639c-4484-8836-1c97c7d82e86EnglishSymplectic Elements at Oxford2012Cautis, BKharlamov, EWe study the complexity of query answering using views in a probabilistic XML setting, identifying large classes of XPath queries -- with child and descendant navigation and predicates -- for which there are efficient (PTime) algorithms. We consider this problem under the two possible semantics for XML query results: with persistent node identifiers and in their absence. Accordingly, we consider rewritings that can exploit a single view, by means of compensation, and rewritings that can use multiple views, by means of intersection. Since in a probabilistic setting queries return answers with probabilities, the problem of rewriting goes beyond the classic one of retrieving XML answers from views. For both semantics of XML queries, we show that, even when XML answers can be retrieved from views, their probabilities may not be computable. For rewritings that use only compensation, we describe a PTime decision procedure, based on easily verifiable criteria that distinguish between the feasible cases -- when probabilistic XML results are computable -- and the unfeasible ones. For rewritings that can use multiple views, with compensation and intersection, we identify the most permissive conditions that make probabilistic rewriting feasible, and we describe an algorithm that is sound in general, and becomes complete under fairly permissive restrictions, running in PTime modulo worst-case exponential time equivalence tests. This is the best we can hope for since intersection makes query equivalence intractable already over deterministic data. Our algorithm runs in PTime whenever deterministic rewritings can be found in PTime.
spellingShingle	Cautis, B Kharlamov, E Answering Queries using Views over Probabilistic XML: Complexity and Tractability
title	Answering Queries using Views over Probabilistic XML: Complexity and Tractability
title_full	Answering Queries using Views over Probabilistic XML: Complexity and Tractability
title_fullStr	Answering Queries using Views over Probabilistic XML: Complexity and Tractability
title_full_unstemmed	Answering Queries using Views over Probabilistic XML: Complexity and Tractability
title_short	Answering Queries using Views over Probabilistic XML: Complexity and Tractability
title_sort	answering queries using views over probabilistic xml complexity and tractability
work_keys_str_mv	AT cautisb answeringqueriesusingviewsoverprobabilisticxmlcomplexityandtractability AT kharlamove answeringqueriesusingviewsoverprobabilisticxmlcomplexityandtractability

Answering Queries using Views over Probabilistic XML: Complexity and Tractability

Նմանատիպ նյութեր