Visual Assessment of Cluster Tendency with Variations of Distance Measures
Finding the cluster structure is essential for analyzing self-organized networking structures, such as social networks. In such problems, a wide variety of distance measures can be used. Common clustering methods often require the number of clusters to be explicitly indicated before starting the pro...
Main Authors: | , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI AG
2022-12-01
|
Series: | Algorithms |
Subjects: | |
Online Access: | https://www.mdpi.com/1999-4893/16/1/5 |
_version_ | 1797447139595386880 |
---|---|
author | Guzel Shkaberina Natalia Rezova Elena Tovbis Lev Kazakovtsev |
author_facet | Guzel Shkaberina Natalia Rezova Elena Tovbis Lev Kazakovtsev |
author_sort | Guzel Shkaberina |
collection | DOAJ |
description | Finding the cluster structure is essential for analyzing self-organized networking structures, such as social networks. In such problems, a wide variety of distance measures can be used. Common clustering methods often require the number of clusters to be explicitly indicated before starting the process of clustering. A preliminary step to clustering is deciding, firstly, whether the data contain any clusters and, secondly, how many clusters the dataset contains. To highlight the internal structure of data, several methods for visual assessment of clustering tendency (VAT family of methods) have been developed. The vast majority of these methods use the Euclidean distance or cosine similarity measure. In our study, we modified the VAT and iVAT algorithms for visual assessment of the clustering tendency with a wide variety of distance measures. We compared the results of our algorithms obtained from both samples from repositories and data from applied problems. |
first_indexed | 2024-03-09T13:50:32Z |
format | Article |
id | doaj.art-dbca7ead568547e3afc0db2d0efbc224 |
institution | Directory Open Access Journal |
issn | 1999-4893 |
language | English |
last_indexed | 2024-03-09T13:50:32Z |
publishDate | 2022-12-01 |
publisher | MDPI AG |
record_format | Article |
series | Algorithms |
spelling | doaj.art-dbca7ead568547e3afc0db2d0efbc2242023-11-30T20:51:05ZengMDPI AGAlgorithms1999-48932022-12-01161510.3390/a16010005Visual Assessment of Cluster Tendency with Variations of Distance MeasuresGuzel Shkaberina0Natalia Rezova1Elena Tovbis2Lev Kazakovtsev3Institute of Informatics and Telecommunications, Reshetnev Siberian State University of Science and Technology, 31 Krasnoyarsky Rabochy Av., 660037 Krasnoyarsk, RussiaInstitute of Informatics and Telecommunications, Reshetnev Siberian State University of Science and Technology, 31 Krasnoyarsky Rabochy Av., 660037 Krasnoyarsk, RussiaInstitute of Informatics and Telecommunications, Reshetnev Siberian State University of Science and Technology, 31 Krasnoyarsky Rabochy Av., 660037 Krasnoyarsk, RussiaInstitute of Informatics and Telecommunications, Reshetnev Siberian State University of Science and Technology, 31 Krasnoyarsky Rabochy Av., 660037 Krasnoyarsk, RussiaFinding the cluster structure is essential for analyzing self-organized networking structures, such as social networks. In such problems, a wide variety of distance measures can be used. Common clustering methods often require the number of clusters to be explicitly indicated before starting the process of clustering. A preliminary step to clustering is deciding, firstly, whether the data contain any clusters and, secondly, how many clusters the dataset contains. To highlight the internal structure of data, several methods for visual assessment of clustering tendency (VAT family of methods) have been developed. The vast majority of these methods use the Euclidean distance or cosine similarity measure. In our study, we modified the VAT and iVAT algorithms for visual assessment of the clustering tendency with a wide variety of distance measures. We compared the results of our algorithms obtained from both samples from repositories and data from applied problems.https://www.mdpi.com/1999-4893/16/1/5pre-clustering problemcluster tendencydistance measureVATiVAT |
spellingShingle | Guzel Shkaberina Natalia Rezova Elena Tovbis Lev Kazakovtsev Visual Assessment of Cluster Tendency with Variations of Distance Measures Algorithms pre-clustering problem cluster tendency distance measure VAT iVAT |
title | Visual Assessment of Cluster Tendency with Variations of Distance Measures |
title_full | Visual Assessment of Cluster Tendency with Variations of Distance Measures |
title_fullStr | Visual Assessment of Cluster Tendency with Variations of Distance Measures |
title_full_unstemmed | Visual Assessment of Cluster Tendency with Variations of Distance Measures |
title_short | Visual Assessment of Cluster Tendency with Variations of Distance Measures |
title_sort | visual assessment of cluster tendency with variations of distance measures |
topic | pre-clustering problem cluster tendency distance measure VAT iVAT |
url | https://www.mdpi.com/1999-4893/16/1/5 |
work_keys_str_mv | AT guzelshkaberina visualassessmentofclustertendencywithvariationsofdistancemeasures AT nataliarezova visualassessmentofclustertendencywithvariationsofdistancemeasures AT elenatovbis visualassessmentofclustertendencywithvariationsofdistancemeasures AT levkazakovtsev visualassessmentofclustertendencywithvariationsofdistancemeasures |