Review of Visualization Methods for Categorical Data in Cluster Analysis

The paper focuses on visualization methods suitable for outcomes of cluster analysis of categorical data (nominal data, specifically). Since nominal data have no inherent order, their graphical representation is often challenging or very limited. This paper aims to provide a list of common visualiza...

Full description

Bibliographic Details
Main Authors: Jana Cibulková, Barbora Kupková
Format: Article
Language:English
Published: Czech Statistical Office 2022-12-01
Series:Statistika: Statistics and Economy Journal
Subjects:
Online Access:https://www.czso.cz/documents/10180/167607761/32019722q4_396-408_cibulkova_analyses.pdf/61bb2c38-69ea-4dc8-a74d-6dcfb582713d?version=1.3
Description
Summary:The paper focuses on visualization methods suitable for outcomes of cluster analysis of categorical data (nominal data, specifically). Since nominal data have no inherent order, their graphical representation is often challenging or very limited. This paper aims to provide a list of common visualization methods in the domain of cluster analysis of objects characterized by nominal variables. Firstly, the various plot types (such as clustering scatter plot, dendrogram, icicle plot) for cluster analysis are presented, and their suitability for presenting clusters of nominal data is discussed. Then, we study approaches of sorting nominal values on chart axes in such a way that would improve visualization of the data. Lastly, we introduce a simple alternative to cluster scatter plot for nominal data, that makes the final visualization of clustering solution more efficient since the pattern and groups in data are now more apparent. The suggested method is demonstrated in illustrative examples.
ISSN:0322-788X
1804-8765