Validity studies among hierarchical methods of cluster analysis using cophenetic correlation coefficient

The literature presents many methods for partitioning of data set, and is difficult choose which is the most suitable, since the various combinations of methods based on different measures of dissimilarity can lead to different patterns of grouping and false interpretations. Nevertheless, little ef...

Full description

Bibliographic Details
Main Authors: Priscilla Ramos Carvalho, Casimiro Sepúlveda Munita, André Luiz Lapolli
Format: Article
Language:English
Published: Brazilian Radiation Protection Society (Sociedade Brasileira de Proteção Radiológica, SBPR) 2019-02-01
Series:Brazilian Journal of Radiation Sciences
Subjects:
Online Access:https://bjrs.org.br/revista/index.php/REVISTA/article/view/668
Description
Summary:The literature presents many methods for partitioning of data set, and is difficult choose which is the most suitable, since the various combinations of methods based on different measures of dissimilarity can lead to different patterns of grouping and false interpretations. Nevertheless, little effort has been expended in evaluating these methods empirically using an archaeological data set. In this way, the objective of this work is make a comparative study of the different cluster analysis methods and identify which is the most appropriate. For this, the study was carried out using a data set of 45 samples of ceramic fragments, analyzed by instrumental neutron activation analysis (INAA). The methods used for this study were: Single linkage, Complete linkage, Average linkage, Centroid and Ward. The validation was done using the cophenetic correlation coefficient and comparing these values the average linkage method obtained better results. A script of the statistical program R with some functions was created to obtain the cophenetic correlation. By means of these values was possible to choose the most appropriate method to be used in the data set.
ISSN:2319-0612