Two novel outlier detection approaches based on unsupervised possibilistic and fuzzy clustering

Outliers are data points that significantly deviate from other data points in a data set because of different mechanisms or unusual processes. Outlier detection is one of the intensively studied research topics for identification of novelties, frauds, anomalies, deviations or exceptions in addition...

Full description

Bibliographic Details
Main Authors: Zeynel Cebeci, Cagatay Cebeci, Yalcin Tahtali, Lutfi Bayyurt
Format: Article
Language:English
Published: PeerJ Inc. 2022-09-01
Series:PeerJ Computer Science
Subjects:
Online Access:https://peerj.com/articles/cs-1060.pdf
_version_ 1818029044054097920
author Zeynel Cebeci
Cagatay Cebeci
Yalcin Tahtali
Lutfi Bayyurt
author_facet Zeynel Cebeci
Cagatay Cebeci
Yalcin Tahtali
Lutfi Bayyurt
author_sort Zeynel Cebeci
collection DOAJ
description Outliers are data points that significantly deviate from other data points in a data set because of different mechanisms or unusual processes. Outlier detection is one of the intensively studied research topics for identification of novelties, frauds, anomalies, deviations or exceptions in addition to its use for data cleansing in data science. In this study, we propose two novel outlier detection approaches using the typicality degrees which are the partitioning result of unsupervised possibilistic clustering algorithms. The proposed approaches are based on finding the atypical data points below a predefined threshold value, a possibilistic level for evaluating a point as an outlier. The experiments on the synthetic and real data sets showed that the proposed approaches can be successfully used to detect outliers without considering the structure and distribution of the features in multidimensional data sets.
first_indexed 2024-12-10T05:13:25Z
format Article
id doaj.art-3e0ba5409b764335bdb460074d391912
institution Directory Open Access Journal
issn 2376-5992
language English
last_indexed 2024-12-10T05:13:25Z
publishDate 2022-09-01
publisher PeerJ Inc.
record_format Article
series PeerJ Computer Science
spelling doaj.art-3e0ba5409b764335bdb460074d3919122022-12-22T02:01:02ZengPeerJ Inc.PeerJ Computer Science2376-59922022-09-018e106010.7717/peerj-cs.1060Two novel outlier detection approaches based on unsupervised possibilistic and fuzzy clusteringZeynel Cebeci0Cagatay Cebeci1Yalcin Tahtali2Lutfi Bayyurt3Department of Animal Science, Faculty of Agriculture, Cukurova University, Adana, TurkeyDepartment of Electronics & Electrical Engineering, University of Strathclyde, Glasgow, United KingdomDepartment of Agriculture, Faculty of Agriculture, Tokat Gaziosmanpasa University, Tokat, TurkeyDepartment of Agriculture, Faculty of Agriculture, Tokat Gaziosmanpasa University, Tokat, TurkeyOutliers are data points that significantly deviate from other data points in a data set because of different mechanisms or unusual processes. Outlier detection is one of the intensively studied research topics for identification of novelties, frauds, anomalies, deviations or exceptions in addition to its use for data cleansing in data science. In this study, we propose two novel outlier detection approaches using the typicality degrees which are the partitioning result of unsupervised possibilistic clustering algorithms. The proposed approaches are based on finding the atypical data points below a predefined threshold value, a possibilistic level for evaluating a point as an outlier. The experiments on the synthetic and real data sets showed that the proposed approaches can be successfully used to detect outliers without considering the structure and distribution of the features in multidimensional data sets.https://peerj.com/articles/cs-1060.pdfOutlier detectionAnomaly detectionUnsupervised learningFuzzy and possibilistic clusteringData analysis
spellingShingle Zeynel Cebeci
Cagatay Cebeci
Yalcin Tahtali
Lutfi Bayyurt
Two novel outlier detection approaches based on unsupervised possibilistic and fuzzy clustering
PeerJ Computer Science
Outlier detection
Anomaly detection
Unsupervised learning
Fuzzy and possibilistic clustering
Data analysis
title Two novel outlier detection approaches based on unsupervised possibilistic and fuzzy clustering
title_full Two novel outlier detection approaches based on unsupervised possibilistic and fuzzy clustering
title_fullStr Two novel outlier detection approaches based on unsupervised possibilistic and fuzzy clustering
title_full_unstemmed Two novel outlier detection approaches based on unsupervised possibilistic and fuzzy clustering
title_short Two novel outlier detection approaches based on unsupervised possibilistic and fuzzy clustering
title_sort two novel outlier detection approaches based on unsupervised possibilistic and fuzzy clustering
topic Outlier detection
Anomaly detection
Unsupervised learning
Fuzzy and possibilistic clustering
Data analysis
url https://peerj.com/articles/cs-1060.pdf
work_keys_str_mv AT zeynelcebeci twonoveloutlierdetectionapproachesbasedonunsupervisedpossibilisticandfuzzyclustering
AT cagataycebeci twonoveloutlierdetectionapproachesbasedonunsupervisedpossibilisticandfuzzyclustering
AT yalcintahtali twonoveloutlierdetectionapproachesbasedonunsupervisedpossibilisticandfuzzyclustering
AT lutfibayyurt twonoveloutlierdetectionapproachesbasedonunsupervisedpossibilisticandfuzzyclustering