A model for processing skyline queries over a database with missing data

Skyline queries provide a flexible query operator that returns data items (skylines) which are not being dominated by other data items in all dimensions (attributes) of the database. Most of the existing skyline techniques determine the skylines by assuming that the values of dimensions for every da...

Full description

Bibliographic Details
Main Authors: Alwan, Ali Amer, Ibrahim, Hamidah, Udzir, Nur Izura
Format: Article
Language:English
Published: Design for Scientific Renaissance 2015
Online Access:http://psasir.upm.edu.my/id/eprint/43513/1/abstract00.pdf
_version_ 1796974358379364352
author Alwan, Ali Amer
Ibrahim, Hamidah
Udzir, Nur Izura
author_facet Alwan, Ali Amer
Ibrahim, Hamidah
Udzir, Nur Izura
author_sort Alwan, Ali Amer
collection UPM
description Skyline queries provide a flexible query operator that returns data items (skylines) which are not being dominated by other data items in all dimensions (attributes) of the database. Most of the existing skyline techniques determine the skylines by assuming that the values of dimensions for every data item are available (complete). However, this assumption is not always true particularly for multidimensional database as some values may be missing. The incompleteness of data leads to the loss of the transitivity property of skyline technique and results into failure in test dominance as some data items are incomparable to each other. Furthermore, incompleteness of data influences negatively on the process of finding skylines, leading to high overhead, due to exhaustive pairwise comparisons between the data items. This paper proposed a model to process skyline queries for incomplete data with the aim of avoiding the issue of cyclic dominance in deriving skylines. The proposed model for identifying skylines for incomplete data consists of four components, namely: Data Clustering Builder, Group Constructor and Local Skylines Identifier, k-dom Skyline Generator, and Incomplete Skylines Identifier. Including these processes in the proposed model has optimized the process of identifying skylines in incomplete database by reducing the necessary number of pairwise comparison through eliminating the dominated data items as early as possible before applying the skyline technique.
first_indexed 2024-03-06T08:55:53Z
format Article
id upm.eprints-43513
institution Universiti Putra Malaysia
language English
last_indexed 2024-03-06T08:55:53Z
publishDate 2015
publisher Design for Scientific Renaissance
record_format dspace
spelling upm.eprints-435132018-02-23T00:27:41Z http://psasir.upm.edu.my/id/eprint/43513/ A model for processing skyline queries over a database with missing data Alwan, Ali Amer Ibrahim, Hamidah Udzir, Nur Izura Skyline queries provide a flexible query operator that returns data items (skylines) which are not being dominated by other data items in all dimensions (attributes) of the database. Most of the existing skyline techniques determine the skylines by assuming that the values of dimensions for every data item are available (complete). However, this assumption is not always true particularly for multidimensional database as some values may be missing. The incompleteness of data leads to the loss of the transitivity property of skyline technique and results into failure in test dominance as some data items are incomparable to each other. Furthermore, incompleteness of data influences negatively on the process of finding skylines, leading to high overhead, due to exhaustive pairwise comparisons between the data items. This paper proposed a model to process skyline queries for incomplete data with the aim of avoiding the issue of cyclic dominance in deriving skylines. The proposed model for identifying skylines for incomplete data consists of four components, namely: Data Clustering Builder, Group Constructor and Local Skylines Identifier, k-dom Skyline Generator, and Incomplete Skylines Identifier. Including these processes in the proposed model has optimized the process of identifying skylines in incomplete database by reducing the necessary number of pairwise comparison through eliminating the dominated data items as early as possible before applying the skyline technique. Design for Scientific Renaissance 2015-09 Article PeerReviewed application/pdf en http://psasir.upm.edu.my/id/eprint/43513/1/abstract00.pdf Alwan, Ali Amer and Ibrahim, Hamidah and Udzir, Nur Izura (2015) A model for processing skyline queries over a database with missing data. Journal of Advanced Computer Science and Technology Research, 5 (3). pp. 71-82. ISSN 2231-8852 http://www.sign-ific-ance.co.uk/index.php/JACSTR/article/view/1169/1107
spellingShingle Alwan, Ali Amer
Ibrahim, Hamidah
Udzir, Nur Izura
A model for processing skyline queries over a database with missing data
title A model for processing skyline queries over a database with missing data
title_full A model for processing skyline queries over a database with missing data
title_fullStr A model for processing skyline queries over a database with missing data
title_full_unstemmed A model for processing skyline queries over a database with missing data
title_short A model for processing skyline queries over a database with missing data
title_sort model for processing skyline queries over a database with missing data
url http://psasir.upm.edu.my/id/eprint/43513/1/abstract00.pdf
work_keys_str_mv AT alwanaliamer amodelforprocessingskylinequeriesoveradatabasewithmissingdata
AT ibrahimhamidah amodelforprocessingskylinequeriesoveradatabasewithmissingdata
AT udzirnurizura amodelforprocessingskylinequeriesoveradatabasewithmissingdata
AT alwanaliamer modelforprocessingskylinequeriesoveradatabasewithmissingdata
AT ibrahimhamidah modelforprocessingskylinequeriesoveradatabasewithmissingdata
AT udzirnurizura modelforprocessingskylinequeriesoveradatabasewithmissingdata