A study of density-grid based clustering algorithms on data streams

Clustering data streams attracted many researchers since the aPlications that generate data streams have become more popular. Several clustering algorithms have been introduced for data streams based on distance which are incompetent to find cluster of arbitrary shapes and cannot handle the outlie...

Full description

Bibliographic Details
Main Authors: Amini, A., Saybani, M.R., Sahaf Yazdi, S.R.A.
Format: Conference or Workshop Item
Language:English
Published: 2011
Subjects:
Online Access:http://eprints.um.edu.my/13232/1/A_Study_of_Density-Grid.pdf
Description
Summary:Clustering data streams attracted many researchers since the aPlications that generate data streams have become more popular. Several clustering algorithms have been introduced for data streams based on distance which are incompetent to find cluster of arbitrary shapes and cannot handle the outliers. Density-based clustering algorithms are remarkable not only to find arbitrarily shaped clusters but also to deal with noise in data. In density-based clustering algorithms, dense areas of objects in the data space are considered as clusters which are segregated by low-density area. Another group of the clustering methods for data streams is grid-based clustering where the data space is quantized into finite number of cells which form the grid structure and perform clustering on the grids. Grid-based clustering maps the infinite number of data records in data stream to finite numbers of grids. In this paper we review the grid based clustering algorithms that use density-based algorithms or density concept for the clustering. We called them density-grid clustering algorithms. We explore the algorithms in details and the merits and limitations of them. The algorithms are also summarized in a table based on the important features. Besides that, we discuss about how well the algorithms address the challenging issues in the clustering data streams.