Estimating missing values of skylines in incomplete database

Incompleteness of data is a common problem in many databases including web heterogonous databases, multirelational databases, spatial and temporal databases and data integration. The incompleteness of data introduces challenges in processing queries as providing accurate results that best meet the q...

Full description

Bibliographic Details
Main Authors: Alwan, Ali A., Ibrahim, Hamidah, Udzir, Nur Izura, Sidi, Fatimah
Format: Conference or Workshop Item
Language:English
Published: The Society of Digital Information and Wireless Communications (SDIWC) 2013
Online Access:http://psasir.upm.edu.my/id/eprint/41329/1/41329.pdf
_version_ 1796973915493367808
author Alwan, Ali A.
Ibrahim, Hamidah
Udzir, Nur Izura
Sidi, Fatimah
author_facet Alwan, Ali A.
Ibrahim, Hamidah
Udzir, Nur Izura
Sidi, Fatimah
author_sort Alwan, Ali A.
collection UPM
description Incompleteness of data is a common problem in many databases including web heterogonous databases, multirelational databases, spatial and temporal databases and data integration. The incompleteness of data introduces challenges in processing queries as providing accurate results that best meet the query conditions over incomplete database is not a trivial task. Several techniques have been proposed to process queries in incomplete database. Some of these techniques retrieve the query results based on the existing values rather than estimating the missing values. Such techniques are undesirable in many cases as the dimensions with missing values might be the important dimensions of the user’s query. Besides, the output is incomplete and might not satisfy the user preferences. In this paper we propose an approach that estimates missing values in skylines to guide users in selecting the most appropriate skylines from the several candidate skylines. The approach utilizes the concept of mining attribute correlations to generate an Approximate Functional Dependencies (AFDs) that captured the relationships between the dimensions. Besides, identifying the strength of probability correlations to estimate the values. Then, the skylines with estimated values are ranked. By doing so, we ensure that the retrieved skylines are in the order of their estimated precision.
first_indexed 2024-03-06T08:49:35Z
format Conference or Workshop Item
id upm.eprints-41329
institution Universiti Putra Malaysia
language English
last_indexed 2024-03-06T08:49:35Z
publishDate 2013
publisher The Society of Digital Information and Wireless Communications (SDIWC)
record_format dspace
spelling upm.eprints-413292015-11-04T04:23:50Z http://psasir.upm.edu.my/id/eprint/41329/ Estimating missing values of skylines in incomplete database Alwan, Ali A. Ibrahim, Hamidah Udzir, Nur Izura Sidi, Fatimah Incompleteness of data is a common problem in many databases including web heterogonous databases, multirelational databases, spatial and temporal databases and data integration. The incompleteness of data introduces challenges in processing queries as providing accurate results that best meet the query conditions over incomplete database is not a trivial task. Several techniques have been proposed to process queries in incomplete database. Some of these techniques retrieve the query results based on the existing values rather than estimating the missing values. Such techniques are undesirable in many cases as the dimensions with missing values might be the important dimensions of the user’s query. Besides, the output is incomplete and might not satisfy the user preferences. In this paper we propose an approach that estimates missing values in skylines to guide users in selecting the most appropriate skylines from the several candidate skylines. The approach utilizes the concept of mining attribute correlations to generate an Approximate Functional Dependencies (AFDs) that captured the relationships between the dimensions. Besides, identifying the strength of probability correlations to estimate the values. Then, the skylines with estimated values are ranked. By doing so, we ensure that the retrieved skylines are in the order of their estimated precision. The Society of Digital Information and Wireless Communications (SDIWC) 2013 Conference or Workshop Item NonPeerReviewed application/pdf en http://psasir.upm.edu.my/id/eprint/41329/1/41329.pdf Alwan, Ali A. and Ibrahim, Hamidah and Udzir, Nur Izura and Sidi, Fatimah (2013) Estimating missing values of skylines in incomplete database. In: The Second International Conference on Digital Enterprise and Information Systems (DEIS 2013), 4-6 Mar. 2013, Kuala Lumpur, Malaysia. (pp. 220-229). http://sdiwc.net/digital-library/download.php?id=00000503.pdf
spellingShingle Alwan, Ali A.
Ibrahim, Hamidah
Udzir, Nur Izura
Sidi, Fatimah
Estimating missing values of skylines in incomplete database
title Estimating missing values of skylines in incomplete database
title_full Estimating missing values of skylines in incomplete database
title_fullStr Estimating missing values of skylines in incomplete database
title_full_unstemmed Estimating missing values of skylines in incomplete database
title_short Estimating missing values of skylines in incomplete database
title_sort estimating missing values of skylines in incomplete database
url http://psasir.upm.edu.my/id/eprint/41329/1/41329.pdf
work_keys_str_mv AT alwanalia estimatingmissingvaluesofskylinesinincompletedatabase
AT ibrahimhamidah estimatingmissingvaluesofskylinesinincompletedatabase
AT udzirnurizura estimatingmissingvaluesofskylinesinincompletedatabase
AT sidifatimah estimatingmissingvaluesofskylinesinincompletedatabase