Active clustering data streams with affinity propagation

Most existing applications have a large number of evolving data streams. Clustering data streams is still a critical problem for these applications as the data are evolving and changes over time. Most existing algorithms are unsupervised learning in which background information is useless. This pape...

Full description

Bibliographic Details
Main Authors: Sameh Abdulah, Ph.D., Walid Atwa, Ph.D., Ahmed M. Abdelmoniem, Ph.D.
Format: Article
Language:English
Published: Elsevier 2022-06-01
Series:ICT Express
Subjects:
Online Access:http://www.sciencedirect.com/science/article/pii/S2405959521001077
Description
Summary:Most existing applications have a large number of evolving data streams. Clustering data streams is still a critical problem for these applications as the data are evolving and changes over time. Most existing algorithms are unsupervised learning in which background information is useless. This paper proposes an active clustering algorithm for data stream based on the affinity propagation method, referred to as AAPStream. The affinity propagation aims to identify exemplars and create clusters based on these exemplars. Thus, the objective is to get the most informative exemplars to create the streaming model and predict the new arrival data. We conduct a set of experiments on real-world datasets to compare our algorithm with a state-of-the-art algorithm, and the experimental results show the effectiveness of the proposed algorithm.
ISSN:2405-9595