A model for processing skyline queries over a database with missing data
Skyline queries provide a flexible query operator that returns data items (skylines) which are not being dominated by other data items in all dimensions (attributes) of the database. Most of the existing skyline techniques determine the skylines by assuming that the values of dimensions for every da...
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Design for Scientific Renaissance
2015
|
Online Access: | http://psasir.upm.edu.my/id/eprint/43513/1/abstract00.pdf |
_version_ | 1796974358379364352 |
---|---|
author | Alwan, Ali Amer Ibrahim, Hamidah Udzir, Nur Izura |
author_facet | Alwan, Ali Amer Ibrahim, Hamidah Udzir, Nur Izura |
author_sort | Alwan, Ali Amer |
collection | UPM |
description | Skyline queries provide a flexible query operator that returns data items (skylines) which are not being dominated by other data items in all dimensions (attributes) of the database. Most of the existing skyline techniques determine the skylines by assuming that the values of dimensions for every data item are available (complete). However, this assumption is not always true particularly for multidimensional database as some values may be missing. The incompleteness of data leads to the loss of the transitivity property of skyline technique and results into failure in test dominance as some data items are incomparable to each other. Furthermore, incompleteness of data influences negatively on the process of finding skylines, leading to high overhead, due to exhaustive pairwise comparisons between the data items. This paper proposed a model to process skyline queries for incomplete data with the aim of avoiding the issue of cyclic dominance in deriving skylines. The proposed model for identifying skylines for incomplete data consists of four components, namely: Data Clustering Builder, Group Constructor and Local Skylines Identifier, k-dom Skyline Generator, and Incomplete Skylines Identifier. Including these processes in the proposed model has optimized the process of identifying skylines in incomplete database by reducing the necessary number of pairwise comparison through eliminating the dominated data items as early as possible before applying the skyline technique. |
first_indexed | 2024-03-06T08:55:53Z |
format | Article |
id | upm.eprints-43513 |
institution | Universiti Putra Malaysia |
language | English |
last_indexed | 2024-03-06T08:55:53Z |
publishDate | 2015 |
publisher | Design for Scientific Renaissance |
record_format | dspace |
spelling | upm.eprints-435132018-02-23T00:27:41Z http://psasir.upm.edu.my/id/eprint/43513/ A model for processing skyline queries over a database with missing data Alwan, Ali Amer Ibrahim, Hamidah Udzir, Nur Izura Skyline queries provide a flexible query operator that returns data items (skylines) which are not being dominated by other data items in all dimensions (attributes) of the database. Most of the existing skyline techniques determine the skylines by assuming that the values of dimensions for every data item are available (complete). However, this assumption is not always true particularly for multidimensional database as some values may be missing. The incompleteness of data leads to the loss of the transitivity property of skyline technique and results into failure in test dominance as some data items are incomparable to each other. Furthermore, incompleteness of data influences negatively on the process of finding skylines, leading to high overhead, due to exhaustive pairwise comparisons between the data items. This paper proposed a model to process skyline queries for incomplete data with the aim of avoiding the issue of cyclic dominance in deriving skylines. The proposed model for identifying skylines for incomplete data consists of four components, namely: Data Clustering Builder, Group Constructor and Local Skylines Identifier, k-dom Skyline Generator, and Incomplete Skylines Identifier. Including these processes in the proposed model has optimized the process of identifying skylines in incomplete database by reducing the necessary number of pairwise comparison through eliminating the dominated data items as early as possible before applying the skyline technique. Design for Scientific Renaissance 2015-09 Article PeerReviewed application/pdf en http://psasir.upm.edu.my/id/eprint/43513/1/abstract00.pdf Alwan, Ali Amer and Ibrahim, Hamidah and Udzir, Nur Izura (2015) A model for processing skyline queries over a database with missing data. Journal of Advanced Computer Science and Technology Research, 5 (3). pp. 71-82. ISSN 2231-8852 http://www.sign-ific-ance.co.uk/index.php/JACSTR/article/view/1169/1107 |
spellingShingle | Alwan, Ali Amer Ibrahim, Hamidah Udzir, Nur Izura A model for processing skyline queries over a database with missing data |
title | A model for processing skyline queries over a database with missing data |
title_full | A model for processing skyline queries over a database with missing data |
title_fullStr | A model for processing skyline queries over a database with missing data |
title_full_unstemmed | A model for processing skyline queries over a database with missing data |
title_short | A model for processing skyline queries over a database with missing data |
title_sort | model for processing skyline queries over a database with missing data |
url | http://psasir.upm.edu.my/id/eprint/43513/1/abstract00.pdf |
work_keys_str_mv | AT alwanaliamer amodelforprocessingskylinequeriesoveradatabasewithmissingdata AT ibrahimhamidah amodelforprocessingskylinequeriesoveradatabasewithmissingdata AT udzirnurizura amodelforprocessingskylinequeriesoveradatabasewithmissingdata AT alwanaliamer modelforprocessingskylinequeriesoveradatabasewithmissingdata AT ibrahimhamidah modelforprocessingskylinequeriesoveradatabasewithmissingdata AT udzirnurizura modelforprocessingskylinequeriesoveradatabasewithmissingdata |