Monitoring Changes in Clustering Solutions: A Review of Models and Applications

This article comprehensively reviews the applications and algorithms used for monitoring the evolution of clustering solutions in data streams. The clustering technique is an unsupervised learning problem that involves the identification of natural subgroups in a large dataset. In contrast to superv...

Full description

Bibliographic Details
Main Authors: Muhammad Atif, Muhammad Shafiq, Muhammad Farooq, Gohar Ayub, Friedrich Leisch, Muhammad Ilyas
Format: Article
Language:English
Published: Wiley 2023-01-01
Series:Journal of Probability and Statistics
Online Access:http://dx.doi.org/10.1155/2023/7493623
_version_ 1826820716530499584
author Muhammad Atif
Muhammad Shafiq
Muhammad Farooq
Gohar Ayub
Friedrich Leisch
Muhammad Ilyas
author_facet Muhammad Atif
Muhammad Shafiq
Muhammad Farooq
Gohar Ayub
Friedrich Leisch
Muhammad Ilyas
author_sort Muhammad Atif
collection DOAJ
description This article comprehensively reviews the applications and algorithms used for monitoring the evolution of clustering solutions in data streams. The clustering technique is an unsupervised learning problem that involves the identification of natural subgroups in a large dataset. In contrast to supervised learning models, clustering is a data mining technique that retrieves the hidden pattern in the input dataset. The clustering solution reflects the mechanism that leads to a high level of similarity between the items. A few applications include pattern recognition, knowledge discovery, and market segmentation. However, many modern-day applications generate streaming or temporal datasets over time, where the pattern is not stationary and may change over time. In the context of this article, change detection is the process of identifying differences in the cluster solutions obtained from streaming datasets at consecutive time points. In this paper, we briefly review the models/algorithms introduced in the literature to monitor clusters’ evolution in data streams. Monitoring the changes in clustering solutions in streaming datasets plays a vital role in policy-making and future prediction. Of course, it has a wide range of applications that cannot be covered in a single study, but some of the most common are highlighted in this article.
first_indexed 2024-03-11T11:16:51Z
format Article
id doaj.art-cdf45bac3770451b8c596956776ac38c
institution Directory Open Access Journal
issn 1687-9538
language English
last_indexed 2025-02-16T06:34:31Z
publishDate 2023-01-01
publisher Wiley
record_format Article
series Journal of Probability and Statistics
spelling doaj.art-cdf45bac3770451b8c596956776ac38c2025-02-03T06:45:16ZengWileyJournal of Probability and Statistics1687-95382023-01-01202310.1155/2023/7493623Monitoring Changes in Clustering Solutions: A Review of Models and ApplicationsMuhammad Atif0Muhammad Shafiq1Muhammad Farooq2Gohar Ayub3Friedrich Leisch4Muhammad Ilyas5Institute of Statistics University of Natural Resources and Life SciencesInstitute of Numerical SciencesDepartment of Statistics University of PeshawarDepartment of Mathematics and StatisticsInstitute of Statistics University of Natural Resources and Life SciencesDepartment of StatisticsThis article comprehensively reviews the applications and algorithms used for monitoring the evolution of clustering solutions in data streams. The clustering technique is an unsupervised learning problem that involves the identification of natural subgroups in a large dataset. In contrast to supervised learning models, clustering is a data mining technique that retrieves the hidden pattern in the input dataset. The clustering solution reflects the mechanism that leads to a high level of similarity between the items. A few applications include pattern recognition, knowledge discovery, and market segmentation. However, many modern-day applications generate streaming or temporal datasets over time, where the pattern is not stationary and may change over time. In the context of this article, change detection is the process of identifying differences in the cluster solutions obtained from streaming datasets at consecutive time points. In this paper, we briefly review the models/algorithms introduced in the literature to monitor clusters’ evolution in data streams. Monitoring the changes in clustering solutions in streaming datasets plays a vital role in policy-making and future prediction. Of course, it has a wide range of applications that cannot be covered in a single study, but some of the most common are highlighted in this article.http://dx.doi.org/10.1155/2023/7493623
spellingShingle Muhammad Atif
Muhammad Shafiq
Muhammad Farooq
Gohar Ayub
Friedrich Leisch
Muhammad Ilyas
Monitoring Changes in Clustering Solutions: A Review of Models and Applications
Journal of Probability and Statistics
title Monitoring Changes in Clustering Solutions: A Review of Models and Applications
title_full Monitoring Changes in Clustering Solutions: A Review of Models and Applications
title_fullStr Monitoring Changes in Clustering Solutions: A Review of Models and Applications
title_full_unstemmed Monitoring Changes in Clustering Solutions: A Review of Models and Applications
title_short Monitoring Changes in Clustering Solutions: A Review of Models and Applications
title_sort monitoring changes in clustering solutions a review of models and applications
url http://dx.doi.org/10.1155/2023/7493623
work_keys_str_mv AT muhammadatif monitoringchangesinclusteringsolutionsareviewofmodelsandapplications
AT muhammadshafiq monitoringchangesinclusteringsolutionsareviewofmodelsandapplications
AT muhammadfarooq monitoringchangesinclusteringsolutionsareviewofmodelsandapplications
AT goharayub monitoringchangesinclusteringsolutionsareviewofmodelsandapplications
AT friedrichleisch monitoringchangesinclusteringsolutionsareviewofmodelsandapplications
AT muhammadilyas monitoringchangesinclusteringsolutionsareviewofmodelsandapplications