Risky business: factor analysis of survey data - assessing the probability of incorrect dimensionalisation.

This paper undertakes a systematic assessment of the extent to which factor analysis the correct number of latent dimensions (factors) when applied to ordered-categorical survey items (so-called Likert items). We simulate 2400 data sets of uni-dimensional Likert items that vary systematically over a...

Full description

Bibliographic Details
Main Authors: Cees van der Eijk, Jonathan Rose
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2015-01-01
Series:PLoS ONE
Online Access:https://doi.org/10.1371/journal.pone.0118900
_version_ 1818682342764445696
author Cees van der Eijk
Jonathan Rose
author_facet Cees van der Eijk
Jonathan Rose
author_sort Cees van der Eijk
collection DOAJ
description This paper undertakes a systematic assessment of the extent to which factor analysis the correct number of latent dimensions (factors) when applied to ordered-categorical survey items (so-called Likert items). We simulate 2400 data sets of uni-dimensional Likert items that vary systematically over a range of conditions such as the underlying population distribution, the number of items, the level of random error, and characteristics of items and item-sets. Each of these datasets is factor analysed in a variety of ways that are frequently used in the extant literature, or that are recommended in current methodological texts. These include exploratory factor retention heuristics such as Kaiser's criterion, Parallel Analysis and a non-graphical scree test, and (for exploratory and confirmatory analyses) evaluations of model fit. These analyses are conducted on the basis of Pearson and polychoric correlations. We find that, irrespective of the particular mode of analysis, factor analysis applied to ordered-categorical survey data very often leads to over-dimensionalisation. The magnitude of this risk depends on the specific way in which factor analysis is conducted, the number of items, the properties of the set of items, and the underlying population distribution. The paper concludes with a discussion of the consequences of over-dimensionalisation, and a brief mention of alternative modes of analysis that are much less prone to such problems.
first_indexed 2024-12-17T10:17:19Z
format Article
id doaj.art-89a37505c0144e38af615075d80f5760
institution Directory Open Access Journal
issn 1932-6203
language English
last_indexed 2024-12-17T10:17:19Z
publishDate 2015-01-01
publisher Public Library of Science (PLoS)
record_format Article
series PLoS ONE
spelling doaj.art-89a37505c0144e38af615075d80f57602022-12-21T21:52:52ZengPublic Library of Science (PLoS)PLoS ONE1932-62032015-01-01103e011890010.1371/journal.pone.0118900Risky business: factor analysis of survey data - assessing the probability of incorrect dimensionalisation.Cees van der EijkJonathan RoseThis paper undertakes a systematic assessment of the extent to which factor analysis the correct number of latent dimensions (factors) when applied to ordered-categorical survey items (so-called Likert items). We simulate 2400 data sets of uni-dimensional Likert items that vary systematically over a range of conditions such as the underlying population distribution, the number of items, the level of random error, and characteristics of items and item-sets. Each of these datasets is factor analysed in a variety of ways that are frequently used in the extant literature, or that are recommended in current methodological texts. These include exploratory factor retention heuristics such as Kaiser's criterion, Parallel Analysis and a non-graphical scree test, and (for exploratory and confirmatory analyses) evaluations of model fit. These analyses are conducted on the basis of Pearson and polychoric correlations. We find that, irrespective of the particular mode of analysis, factor analysis applied to ordered-categorical survey data very often leads to over-dimensionalisation. The magnitude of this risk depends on the specific way in which factor analysis is conducted, the number of items, the properties of the set of items, and the underlying population distribution. The paper concludes with a discussion of the consequences of over-dimensionalisation, and a brief mention of alternative modes of analysis that are much less prone to such problems.https://doi.org/10.1371/journal.pone.0118900
spellingShingle Cees van der Eijk
Jonathan Rose
Risky business: factor analysis of survey data - assessing the probability of incorrect dimensionalisation.
PLoS ONE
title Risky business: factor analysis of survey data - assessing the probability of incorrect dimensionalisation.
title_full Risky business: factor analysis of survey data - assessing the probability of incorrect dimensionalisation.
title_fullStr Risky business: factor analysis of survey data - assessing the probability of incorrect dimensionalisation.
title_full_unstemmed Risky business: factor analysis of survey data - assessing the probability of incorrect dimensionalisation.
title_short Risky business: factor analysis of survey data - assessing the probability of incorrect dimensionalisation.
title_sort risky business factor analysis of survey data assessing the probability of incorrect dimensionalisation
url https://doi.org/10.1371/journal.pone.0118900
work_keys_str_mv AT ceesvandereijk riskybusinessfactoranalysisofsurveydataassessingtheprobabilityofincorrectdimensionalisation
AT jonathanrose riskybusinessfactoranalysisofsurveydataassessingtheprobabilityofincorrectdimensionalisation