Visual Assessment of Cluster Tendency with Variations of Distance Measures

Finding the cluster structure is essential for analyzing self-organized networking structures, such as social networks. In such problems, a wide variety of distance measures can be used. Common clustering methods often require the number of clusters to be explicitly indicated before starting the pro...

Full description

Bibliographic Details
Main Authors: Guzel Shkaberina, Natalia Rezova, Elena Tovbis, Lev Kazakovtsev
Format: Article
Language:English
Published: MDPI AG 2022-12-01
Series:Algorithms
Subjects:
Online Access:https://www.mdpi.com/1999-4893/16/1/5
_version_ 1797447139595386880
author Guzel Shkaberina
Natalia Rezova
Elena Tovbis
Lev Kazakovtsev
author_facet Guzel Shkaberina
Natalia Rezova
Elena Tovbis
Lev Kazakovtsev
author_sort Guzel Shkaberina
collection DOAJ
description Finding the cluster structure is essential for analyzing self-organized networking structures, such as social networks. In such problems, a wide variety of distance measures can be used. Common clustering methods often require the number of clusters to be explicitly indicated before starting the process of clustering. A preliminary step to clustering is deciding, firstly, whether the data contain any clusters and, secondly, how many clusters the dataset contains. To highlight the internal structure of data, several methods for visual assessment of clustering tendency (VAT family of methods) have been developed. The vast majority of these methods use the Euclidean distance or cosine similarity measure. In our study, we modified the VAT and iVAT algorithms for visual assessment of the clustering tendency with a wide variety of distance measures. We compared the results of our algorithms obtained from both samples from repositories and data from applied problems.
first_indexed 2024-03-09T13:50:32Z
format Article
id doaj.art-dbca7ead568547e3afc0db2d0efbc224
institution Directory Open Access Journal
issn 1999-4893
language English
last_indexed 2024-03-09T13:50:32Z
publishDate 2022-12-01
publisher MDPI AG
record_format Article
series Algorithms
spelling doaj.art-dbca7ead568547e3afc0db2d0efbc2242023-11-30T20:51:05ZengMDPI AGAlgorithms1999-48932022-12-01161510.3390/a16010005Visual Assessment of Cluster Tendency with Variations of Distance MeasuresGuzel Shkaberina0Natalia Rezova1Elena Tovbis2Lev Kazakovtsev3Institute of Informatics and Telecommunications, Reshetnev Siberian State University of Science and Technology, 31 Krasnoyarsky Rabochy Av., 660037 Krasnoyarsk, RussiaInstitute of Informatics and Telecommunications, Reshetnev Siberian State University of Science and Technology, 31 Krasnoyarsky Rabochy Av., 660037 Krasnoyarsk, RussiaInstitute of Informatics and Telecommunications, Reshetnev Siberian State University of Science and Technology, 31 Krasnoyarsky Rabochy Av., 660037 Krasnoyarsk, RussiaInstitute of Informatics and Telecommunications, Reshetnev Siberian State University of Science and Technology, 31 Krasnoyarsky Rabochy Av., 660037 Krasnoyarsk, RussiaFinding the cluster structure is essential for analyzing self-organized networking structures, such as social networks. In such problems, a wide variety of distance measures can be used. Common clustering methods often require the number of clusters to be explicitly indicated before starting the process of clustering. A preliminary step to clustering is deciding, firstly, whether the data contain any clusters and, secondly, how many clusters the dataset contains. To highlight the internal structure of data, several methods for visual assessment of clustering tendency (VAT family of methods) have been developed. The vast majority of these methods use the Euclidean distance or cosine similarity measure. In our study, we modified the VAT and iVAT algorithms for visual assessment of the clustering tendency with a wide variety of distance measures. We compared the results of our algorithms obtained from both samples from repositories and data from applied problems.https://www.mdpi.com/1999-4893/16/1/5pre-clustering problemcluster tendencydistance measureVATiVAT
spellingShingle Guzel Shkaberina
Natalia Rezova
Elena Tovbis
Lev Kazakovtsev
Visual Assessment of Cluster Tendency with Variations of Distance Measures
Algorithms
pre-clustering problem
cluster tendency
distance measure
VAT
iVAT
title Visual Assessment of Cluster Tendency with Variations of Distance Measures
title_full Visual Assessment of Cluster Tendency with Variations of Distance Measures
title_fullStr Visual Assessment of Cluster Tendency with Variations of Distance Measures
title_full_unstemmed Visual Assessment of Cluster Tendency with Variations of Distance Measures
title_short Visual Assessment of Cluster Tendency with Variations of Distance Measures
title_sort visual assessment of cluster tendency with variations of distance measures
topic pre-clustering problem
cluster tendency
distance measure
VAT
iVAT
url https://www.mdpi.com/1999-4893/16/1/5
work_keys_str_mv AT guzelshkaberina visualassessmentofclustertendencywithvariationsofdistancemeasures
AT nataliarezova visualassessmentofclustertendencywithvariationsofdistancemeasures
AT elenatovbis visualassessmentofclustertendencywithvariationsofdistancemeasures
AT levkazakovtsev visualassessmentofclustertendencywithvariationsofdistancemeasures