Robust local triangular kernel density-based clustering for high-dimensional data

A number of clustering algorithms can be employed to find clusters in multivariate data. However, the effectiveness and efficiency of the existing algorithms are limited, since the respective data has high dimension, contain large amount of noise and consist of clusters with arbitrary shapes and den...

Full description

Bibliographic Details
Main Authors: Musdholifah, Aina, Mohd Hashim, Siti Zaiton
Format: Conference or Workshop Item
Published: 2013
Subjects:
_version_ 1796859613019111424
author Musdholifah, Aina
Mohd Hashim, Siti Zaiton
author_facet Musdholifah, Aina
Mohd Hashim, Siti Zaiton
author_sort Musdholifah, Aina
collection ePrints
description A number of clustering algorithms can be employed to find clusters in multivariate data. However, the effectiveness and efficiency of the existing algorithms are limited, since the respective data has high dimension, contain large amount of noise and consist of clusters with arbitrary shapes and densities. In this paper, a new kernel density-based clustering algorithm, called Local Triangular Kernel-based Clustering (LTKC), is proposed to deal with these conditions. LTKC is based on combination of k-nearest-neighbor density estimation and triangular kernel density-based clustering. The advantages of our LTKC approach are: (1) it has a firm mathematical basis; (2) it requires only one parameter, number of neighbors; (3) it defines the number of cluster automatically; (4) it allows discovering clusters with arbitrary shapes and densities; and (5) it is significantly faster than existing algorithms. LTKC is tested using artificial data and applied to some UCI data. A comparison with k-means, KFCM and well known density-based clustering algorithms including ILGC, DBSCAN, and DENCLUE shows the superiority of our proposed LTKC algorithm.
first_indexed 2024-03-05T19:29:45Z
format Conference or Workshop Item
id utm.eprints-51289
institution Universiti Teknologi Malaysia - ePrints
last_indexed 2024-03-05T19:29:45Z
publishDate 2013
record_format dspace
spelling utm.eprints-512892017-07-18T07:46:37Z http://eprints.utm.my/51289/ Robust local triangular kernel density-based clustering for high-dimensional data Musdholifah, Aina Mohd Hashim, Siti Zaiton QA75 Electronic computers. Computer science A number of clustering algorithms can be employed to find clusters in multivariate data. However, the effectiveness and efficiency of the existing algorithms are limited, since the respective data has high dimension, contain large amount of noise and consist of clusters with arbitrary shapes and densities. In this paper, a new kernel density-based clustering algorithm, called Local Triangular Kernel-based Clustering (LTKC), is proposed to deal with these conditions. LTKC is based on combination of k-nearest-neighbor density estimation and triangular kernel density-based clustering. The advantages of our LTKC approach are: (1) it has a firm mathematical basis; (2) it requires only one parameter, number of neighbors; (3) it defines the number of cluster automatically; (4) it allows discovering clusters with arbitrary shapes and densities; and (5) it is significantly faster than existing algorithms. LTKC is tested using artificial data and applied to some UCI data. A comparison with k-means, KFCM and well known density-based clustering algorithms including ILGC, DBSCAN, and DENCLUE shows the superiority of our proposed LTKC algorithm. 2013 Conference or Workshop Item PeerReviewed Musdholifah, Aina and Mohd Hashim, Siti Zaiton (2013) Robust local triangular kernel density-based clustering for high-dimensional data. In: 2013 5th International Conference on Computer Science and Information Technology, CSIT 2013 - Proceedings, MAR 27-28, 2013, Amman, Jordon. http://apps.webofknowledge.com.ezproxy.utm.my/full_record.do?product=WOS&search_mode=GeneralSearch&qid=11&SID=R2Cjh3fA6kIeWhVr585&page=1&doc=1
spellingShingle QA75 Electronic computers. Computer science
Musdholifah, Aina
Mohd Hashim, Siti Zaiton
Robust local triangular kernel density-based clustering for high-dimensional data
title Robust local triangular kernel density-based clustering for high-dimensional data
title_full Robust local triangular kernel density-based clustering for high-dimensional data
title_fullStr Robust local triangular kernel density-based clustering for high-dimensional data
title_full_unstemmed Robust local triangular kernel density-based clustering for high-dimensional data
title_short Robust local triangular kernel density-based clustering for high-dimensional data
title_sort robust local triangular kernel density based clustering for high dimensional data
topic QA75 Electronic computers. Computer science
work_keys_str_mv AT musdholifahaina robustlocaltriangularkerneldensitybasedclusteringforhighdimensionaldata
AT mohdhashimsitizaiton robustlocaltriangularkerneldensitybasedclusteringforhighdimensionaldata