Multiple Correspondence Analysis is a Useful Tool to Visualize Complex Categorical Correlated Data

ABSTRACT Objectives We sought to identify the most expensive hospitalized individuals in the Canadian province of Saskatchewan in fiscal year 2012/13, and determine the primary cause of their high use of health services. Our aim was to identify health problems that can be prevented or better mana...

Full description

Bibliographic Details
Main Authors: Jacqueline Quail, Meric Osman, Gary Teare
Format: Article
Language:English
Published: Swansea University 2017-04-01
Series:International Journal of Population Data Science
Online Access:https://ijpds.org/article/view/118
_version_ 1797425599468273664
author Jacqueline Quail
Meric Osman
Gary Teare
author_facet Jacqueline Quail
Meric Osman
Gary Teare
author_sort Jacqueline Quail
collection DOAJ
description ABSTRACT Objectives We sought to identify the most expensive hospitalized individuals in the Canadian province of Saskatchewan in fiscal year 2012/13, and determine the primary cause of their high use of health services. Our aim was to identify health problems that can be prevented or better managed in a non-hospital health care setting. Comorbid conditions are an important and confounding covariate in this population and so we used multiple correspondence analysis (MCA) to investigate the association of these conditions with each other and the most responsible diagnosis for each hospitalization. MCA is a multivariable descriptive statistical technique that displays the relationship between categorical variables in 2-dimensional graphical form. Approach We identified the most expensive 5% of people hospitalized between 01APR2012 and 31MAR2013. Hospital costs accounted for the majority of costs, but physician, drug, long-term care, and home care costs were added. Comorbid conditions in any of the 25 hospital diagnostic fields were identified and grouped into categories based upon ICD-10-CA subcategories. For example, category 1 was ICD-10-CA codes F10-F19: Mental and behavioural disorders due to psychoactive drug use, while category 2 was ICD-10-CA codes F20-F29: Schizophrenia, schizotypal, and delusional disorders. SAS™ v9.3 was used to conduct MCA and generate graphs displaying the correlation between each comorbid condition category, where the distance of each dot from the other represents the strength of the association between the disease categories (i.e., diseases that are correlated cluster together.) The frequency of each category of comorbid condition was represented by the size of the dots on the graph (e.g., the more people with the disease, the larger the dot.) Categories of comorbid conditions were redefined based upon data findings and clinical expertise. Results Three patient groups emerged as being amenable to intervention and thus cost savings, specifically (1) individuals of advanced age who are no longer able to live at home and are hospitalized while waiting for a bed in a long-term care facility, (2) individuals with a mental health and/or addiction problem, and (3) individuals who experienced medical harm during their time in hospital. Conclusion MCA is a valuable graphical tool that is easy to learn and, in conjunction with other statistical techniques, can be used to elucidate the relationship between complex correlated categorical variables.
first_indexed 2024-03-09T08:18:25Z
format Article
id doaj.art-3f80109aad874cb2a075747ecde45a53
institution Directory Open Access Journal
issn 2399-4908
language English
last_indexed 2024-03-09T08:18:25Z
publishDate 2017-04-01
publisher Swansea University
record_format Article
series International Journal of Population Data Science
spelling doaj.art-3f80109aad874cb2a075747ecde45a532023-12-02T21:58:03ZengSwansea UniversityInternational Journal of Population Data Science2399-49082017-04-011110.23889/ijpds.v1i1.118118Multiple Correspondence Analysis is a Useful Tool to Visualize Complex Categorical Correlated DataJacqueline Quail0Meric Osman1Gary Teare2Health Quality Council (Saskatchewan)Health Quality Council (Saskatchewan)Health Quality Council (Saskatchewan)ABSTRACT Objectives We sought to identify the most expensive hospitalized individuals in the Canadian province of Saskatchewan in fiscal year 2012/13, and determine the primary cause of their high use of health services. Our aim was to identify health problems that can be prevented or better managed in a non-hospital health care setting. Comorbid conditions are an important and confounding covariate in this population and so we used multiple correspondence analysis (MCA) to investigate the association of these conditions with each other and the most responsible diagnosis for each hospitalization. MCA is a multivariable descriptive statistical technique that displays the relationship between categorical variables in 2-dimensional graphical form. Approach We identified the most expensive 5% of people hospitalized between 01APR2012 and 31MAR2013. Hospital costs accounted for the majority of costs, but physician, drug, long-term care, and home care costs were added. Comorbid conditions in any of the 25 hospital diagnostic fields were identified and grouped into categories based upon ICD-10-CA subcategories. For example, category 1 was ICD-10-CA codes F10-F19: Mental and behavioural disorders due to psychoactive drug use, while category 2 was ICD-10-CA codes F20-F29: Schizophrenia, schizotypal, and delusional disorders. SAS™ v9.3 was used to conduct MCA and generate graphs displaying the correlation between each comorbid condition category, where the distance of each dot from the other represents the strength of the association between the disease categories (i.e., diseases that are correlated cluster together.) The frequency of each category of comorbid condition was represented by the size of the dots on the graph (e.g., the more people with the disease, the larger the dot.) Categories of comorbid conditions were redefined based upon data findings and clinical expertise. Results Three patient groups emerged as being amenable to intervention and thus cost savings, specifically (1) individuals of advanced age who are no longer able to live at home and are hospitalized while waiting for a bed in a long-term care facility, (2) individuals with a mental health and/or addiction problem, and (3) individuals who experienced medical harm during their time in hospital. Conclusion MCA is a valuable graphical tool that is easy to learn and, in conjunction with other statistical techniques, can be used to elucidate the relationship between complex correlated categorical variables.https://ijpds.org/article/view/118
spellingShingle Jacqueline Quail
Meric Osman
Gary Teare
Multiple Correspondence Analysis is a Useful Tool to Visualize Complex Categorical Correlated Data
International Journal of Population Data Science
title Multiple Correspondence Analysis is a Useful Tool to Visualize Complex Categorical Correlated Data
title_full Multiple Correspondence Analysis is a Useful Tool to Visualize Complex Categorical Correlated Data
title_fullStr Multiple Correspondence Analysis is a Useful Tool to Visualize Complex Categorical Correlated Data
title_full_unstemmed Multiple Correspondence Analysis is a Useful Tool to Visualize Complex Categorical Correlated Data
title_short Multiple Correspondence Analysis is a Useful Tool to Visualize Complex Categorical Correlated Data
title_sort multiple correspondence analysis is a useful tool to visualize complex categorical correlated data
url https://ijpds.org/article/view/118
work_keys_str_mv AT jacquelinequail multiplecorrespondenceanalysisisausefultooltovisualizecomplexcategoricalcorrelateddata
AT mericosman multiplecorrespondenceanalysisisausefultooltovisualizecomplexcategoricalcorrelateddata
AT garyteare multiplecorrespondenceanalysisisausefultooltovisualizecomplexcategoricalcorrelateddata