Monitoring Changes in Clustering Solutions: A Review of Models and Applications
This article comprehensively reviews the applications and algorithms used for monitoring the evolution of clustering solutions in data streams. The clustering technique is an unsupervised learning problem that involves the identification of natural subgroups in a large dataset. In contrast to superv...
Main Authors: | , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Wiley
2023-01-01
|
Series: | Journal of Probability and Statistics |
Online Access: | http://dx.doi.org/10.1155/2023/7493623 |
_version_ | 1826820716530499584 |
---|---|
author | Muhammad Atif Muhammad Shafiq Muhammad Farooq Gohar Ayub Friedrich Leisch Muhammad Ilyas |
author_facet | Muhammad Atif Muhammad Shafiq Muhammad Farooq Gohar Ayub Friedrich Leisch Muhammad Ilyas |
author_sort | Muhammad Atif |
collection | DOAJ |
description | This article comprehensively reviews the applications and algorithms used for monitoring the evolution of clustering solutions in data streams. The clustering technique is an unsupervised learning problem that involves the identification of natural subgroups in a large dataset. In contrast to supervised learning models, clustering is a data mining technique that retrieves the hidden pattern in the input dataset. The clustering solution reflects the mechanism that leads to a high level of similarity between the items. A few applications include pattern recognition, knowledge discovery, and market segmentation. However, many modern-day applications generate streaming or temporal datasets over time, where the pattern is not stationary and may change over time. In the context of this article, change detection is the process of identifying differences in the cluster solutions obtained from streaming datasets at consecutive time points. In this paper, we briefly review the models/algorithms introduced in the literature to monitor clusters’ evolution in data streams. Monitoring the changes in clustering solutions in streaming datasets plays a vital role in policy-making and future prediction. Of course, it has a wide range of applications that cannot be covered in a single study, but some of the most common are highlighted in this article. |
first_indexed | 2024-03-11T11:16:51Z |
format | Article |
id | doaj.art-cdf45bac3770451b8c596956776ac38c |
institution | Directory Open Access Journal |
issn | 1687-9538 |
language | English |
last_indexed | 2025-02-16T06:34:31Z |
publishDate | 2023-01-01 |
publisher | Wiley |
record_format | Article |
series | Journal of Probability and Statistics |
spelling | doaj.art-cdf45bac3770451b8c596956776ac38c2025-02-03T06:45:16ZengWileyJournal of Probability and Statistics1687-95382023-01-01202310.1155/2023/7493623Monitoring Changes in Clustering Solutions: A Review of Models and ApplicationsMuhammad Atif0Muhammad Shafiq1Muhammad Farooq2Gohar Ayub3Friedrich Leisch4Muhammad Ilyas5Institute of Statistics University of Natural Resources and Life SciencesInstitute of Numerical SciencesDepartment of Statistics University of PeshawarDepartment of Mathematics and StatisticsInstitute of Statistics University of Natural Resources and Life SciencesDepartment of StatisticsThis article comprehensively reviews the applications and algorithms used for monitoring the evolution of clustering solutions in data streams. The clustering technique is an unsupervised learning problem that involves the identification of natural subgroups in a large dataset. In contrast to supervised learning models, clustering is a data mining technique that retrieves the hidden pattern in the input dataset. The clustering solution reflects the mechanism that leads to a high level of similarity between the items. A few applications include pattern recognition, knowledge discovery, and market segmentation. However, many modern-day applications generate streaming or temporal datasets over time, where the pattern is not stationary and may change over time. In the context of this article, change detection is the process of identifying differences in the cluster solutions obtained from streaming datasets at consecutive time points. In this paper, we briefly review the models/algorithms introduced in the literature to monitor clusters’ evolution in data streams. Monitoring the changes in clustering solutions in streaming datasets plays a vital role in policy-making and future prediction. Of course, it has a wide range of applications that cannot be covered in a single study, but some of the most common are highlighted in this article.http://dx.doi.org/10.1155/2023/7493623 |
spellingShingle | Muhammad Atif Muhammad Shafiq Muhammad Farooq Gohar Ayub Friedrich Leisch Muhammad Ilyas Monitoring Changes in Clustering Solutions: A Review of Models and Applications Journal of Probability and Statistics |
title | Monitoring Changes in Clustering Solutions: A Review of Models and Applications |
title_full | Monitoring Changes in Clustering Solutions: A Review of Models and Applications |
title_fullStr | Monitoring Changes in Clustering Solutions: A Review of Models and Applications |
title_full_unstemmed | Monitoring Changes in Clustering Solutions: A Review of Models and Applications |
title_short | Monitoring Changes in Clustering Solutions: A Review of Models and Applications |
title_sort | monitoring changes in clustering solutions a review of models and applications |
url | http://dx.doi.org/10.1155/2023/7493623 |
work_keys_str_mv | AT muhammadatif monitoringchangesinclusteringsolutionsareviewofmodelsandapplications AT muhammadshafiq monitoringchangesinclusteringsolutionsareviewofmodelsandapplications AT muhammadfarooq monitoringchangesinclusteringsolutionsareviewofmodelsandapplications AT goharayub monitoringchangesinclusteringsolutionsareviewofmodelsandapplications AT friedrichleisch monitoringchangesinclusteringsolutionsareviewofmodelsandapplications AT muhammadilyas monitoringchangesinclusteringsolutionsareviewofmodelsandapplications |