Capturing continuous data and answering aggregate queries in probabilistic XML.

Sources of data uncertainty and imprecision are numerous. A way to handle this uncertainty is to associate probabilistic annotations to data. Many such probabilistic database models have been proposed, both in the relational and in the semi-structured setting. The latter is particularly well adapted...

Olles dieđut

Bibliográfalaš dieđut
Váldodahkkit: Abiteboul, S, Chan, T, Kharlamov, E, Nutt, W, Senellart, P
Materiálatiipa: Journal article
Giella:English
Almmustuhtton: 2011
_version_ 1826265482755309568
author Abiteboul, S
Chan, T
Kharlamov, E
Nutt, W
Senellart, P
author_facet Abiteboul, S
Chan, T
Kharlamov, E
Nutt, W
Senellart, P
author_sort Abiteboul, S
collection OXFORD
description Sources of data uncertainty and imprecision are numerous. A way to handle this uncertainty is to associate probabilistic annotations to data. Many such probabilistic database models have been proposed, both in the relational and in the semi-structured setting. The latter is particularly well adapted to the management of uncertain data coming from a variety of automatic processes. An important problem, in the context of probabilistic XML databases, is that of answering aggregate queries (count, sum, avg, etc.), which has received limited attention so far. In a model unifying the various (discrete) semi-structured probabilistic models studied up to now, we present algorithms to compute the distribution of the aggregation values (exploiting some regularity properties of the aggregate functions) and probabilistic moments (especially expectation and variance) of this distribution.We also prove the intractability of some of these problems and investigate approximation techniques. We finally extend the discrete model to a continuous one, in order to take into account continuous data values, such as measurements from sensor networks, and extend our algorithms and complexity results to the continuous case. © 2011 ACM.
first_indexed 2024-03-06T20:24:23Z
format Journal article
id oxford-uuid:2eea85db-e8c0-45cb-b092-e3d5c0d853f9
institution University of Oxford
language English
last_indexed 2024-03-06T20:24:23Z
publishDate 2011
record_format dspace
spelling oxford-uuid:2eea85db-e8c0-45cb-b092-e3d5c0d853f92022-03-26T12:51:50ZCapturing continuous data and answering aggregate queries in probabilistic XML.Journal articlehttp://purl.org/coar/resource_type/c_dcae04bcuuid:2eea85db-e8c0-45cb-b092-e3d5c0d853f9EnglishSymplectic Elements at Oxford2011Abiteboul, SChan, TKharlamov, ENutt, WSenellart, PSources of data uncertainty and imprecision are numerous. A way to handle this uncertainty is to associate probabilistic annotations to data. Many such probabilistic database models have been proposed, both in the relational and in the semi-structured setting. The latter is particularly well adapted to the management of uncertain data coming from a variety of automatic processes. An important problem, in the context of probabilistic XML databases, is that of answering aggregate queries (count, sum, avg, etc.), which has received limited attention so far. In a model unifying the various (discrete) semi-structured probabilistic models studied up to now, we present algorithms to compute the distribution of the aggregation values (exploiting some regularity properties of the aggregate functions) and probabilistic moments (especially expectation and variance) of this distribution.We also prove the intractability of some of these problems and investigate approximation techniques. We finally extend the discrete model to a continuous one, in order to take into account continuous data values, such as measurements from sensor networks, and extend our algorithms and complexity results to the continuous case. © 2011 ACM.
spellingShingle Abiteboul, S
Chan, T
Kharlamov, E
Nutt, W
Senellart, P
Capturing continuous data and answering aggregate queries in probabilistic XML.
title Capturing continuous data and answering aggregate queries in probabilistic XML.
title_full Capturing continuous data and answering aggregate queries in probabilistic XML.
title_fullStr Capturing continuous data and answering aggregate queries in probabilistic XML.
title_full_unstemmed Capturing continuous data and answering aggregate queries in probabilistic XML.
title_short Capturing continuous data and answering aggregate queries in probabilistic XML.
title_sort capturing continuous data and answering aggregate queries in probabilistic xml
work_keys_str_mv AT abitebouls capturingcontinuousdataandansweringaggregatequeriesinprobabilisticxml
AT chant capturingcontinuousdataandansweringaggregatequeriesinprobabilisticxml
AT kharlamove capturingcontinuousdataandansweringaggregatequeriesinprobabilisticxml
AT nuttw capturingcontinuousdataandansweringaggregatequeriesinprobabilisticxml
AT senellartp capturingcontinuousdataandansweringaggregatequeriesinprobabilisticxml