A Skyline-Based Decision Boundary Estimation Method for Binominal Classification in Big Data

One of the most common tasks nowadays in big data environments is the need to classify large amounts of data. There are numerous classification models designed to perform best in different environments and datasets, each with its advantages and disadvantages. However, when dealing with big data, the...

Full description

Bibliographic Details
Main Authors: Christos Kalyvas, Manolis Maragoudakis
Format: Article
Language:English
Published: MDPI AG 2020-09-01
Series:Computation
Subjects:
Online Access:https://www.mdpi.com/2079-3197/8/3/80
_version_ 1827706502129909760
author Christos Kalyvas
Manolis Maragoudakis
author_facet Christos Kalyvas
Manolis Maragoudakis
author_sort Christos Kalyvas
collection DOAJ
description One of the most common tasks nowadays in big data environments is the need to classify large amounts of data. There are numerous classification models designed to perform best in different environments and datasets, each with its advantages and disadvantages. However, when dealing with big data, their performance is significantly degraded because they are not designed—or even capable—of handling very large datasets. The current approach is based on a novel proposal of exploiting the dynamics of skyline queries to efficiently identify the decision boundary and classify big data. A comparison against the popular k-nearest neighbor (k-NN), support vector machines (SVM) and naïve Bayes classification algorithms shows that the proposed method is faster than the k-NN and the SVM. The novelty of this method is based on the fact that only a small number of computations are needed in order to make a prediction, while its full potential is revealed in very large datasets.
first_indexed 2024-03-10T16:26:13Z
format Article
id doaj.art-2eb63f36ecea44958dcea251dbe65306
institution Directory Open Access Journal
issn 2079-3197
language English
last_indexed 2024-03-10T16:26:13Z
publishDate 2020-09-01
publisher MDPI AG
record_format Article
series Computation
spelling doaj.art-2eb63f36ecea44958dcea251dbe653062023-11-20T13:10:17ZengMDPI AGComputation2079-31972020-09-01838010.3390/computation8030080A Skyline-Based Decision Boundary Estimation Method for Binominal Classification in Big DataChristos Kalyvas0Manolis Maragoudakis1Department of Information and Communication Systems Engineering, University of the Aegean, 83200 Samos, GreeceDepartment of Informatics, Ionian University, 49100 Corfu, GreeceOne of the most common tasks nowadays in big data environments is the need to classify large amounts of data. There are numerous classification models designed to perform best in different environments and datasets, each with its advantages and disadvantages. However, when dealing with big data, their performance is significantly degraded because they are not designed—or even capable—of handling very large datasets. The current approach is based on a novel proposal of exploiting the dynamics of skyline queries to efficiently identify the decision boundary and classify big data. A comparison against the popular k-nearest neighbor (k-NN), support vector machines (SVM) and naïve Bayes classification algorithms shows that the proposed method is faster than the k-NN and the SVM. The novelty of this method is based on the fact that only a small number of computations are needed in order to make a prediction, while its full potential is revealed in very large datasets.https://www.mdpi.com/2079-3197/8/3/80classificationskylinebig datadecision boundary
spellingShingle Christos Kalyvas
Manolis Maragoudakis
A Skyline-Based Decision Boundary Estimation Method for Binominal Classification in Big Data
Computation
classification
skyline
big data
decision boundary
title A Skyline-Based Decision Boundary Estimation Method for Binominal Classification in Big Data
title_full A Skyline-Based Decision Boundary Estimation Method for Binominal Classification in Big Data
title_fullStr A Skyline-Based Decision Boundary Estimation Method for Binominal Classification in Big Data
title_full_unstemmed A Skyline-Based Decision Boundary Estimation Method for Binominal Classification in Big Data
title_short A Skyline-Based Decision Boundary Estimation Method for Binominal Classification in Big Data
title_sort skyline based decision boundary estimation method for binominal classification in big data
topic classification
skyline
big data
decision boundary
url https://www.mdpi.com/2079-3197/8/3/80
work_keys_str_mv AT christoskalyvas askylinebaseddecisionboundaryestimationmethodforbinominalclassificationinbigdata
AT manolismaragoudakis askylinebaseddecisionboundaryestimationmethodforbinominalclassificationinbigdata
AT christoskalyvas skylinebaseddecisionboundaryestimationmethodforbinominalclassificationinbigdata
AT manolismaragoudakis skylinebaseddecisionboundaryestimationmethodforbinominalclassificationinbigdata