Evaluating Cluster-Level Factor Models with <tt>lavaan</tt> and M<i>plus</i>

<i>Background</i>: Researchers frequently use the responses of individuals in clusters to measure cluster-level constructs. Examples are the use of student evaluations to measure teaching quality, or the use of employee ratings of organizational climate. In earlier research, Stapleton an...

Full description

Bibliographic Details
Main Authors: Suzanne Jak, Terrence D. Jorgensen, Yves Rosseel
Format: Article
Language:English
Published: MDPI AG 2021-05-01
Series:Psych
Subjects:
Online Access:https://www.mdpi.com/2624-8611/3/2/12
Description
Summary:<i>Background</i>: Researchers frequently use the responses of individuals in clusters to measure cluster-level constructs. Examples are the use of student evaluations to measure teaching quality, or the use of employee ratings of organizational climate. In earlier research, Stapleton and Johnson (2019) provided advice for measuring cluster-level constructs based on a simulation study with inadvertently confounded design factors. We extended their simulation study using both M<i>plus</i> and lavaan to reveal how their conclusions were dependent on their study conditions. <i>Methods:</i> We generated data sets from the so-called configural model and the simultaneous shared-and-configural model, both with and without nonzero residual variances at the cluster level. We fitted models to these data sets using different maximum likelihood estimation algorithms. <i>Results:</i> Stapleton and Johnson’s results were highly contingent on their confounded design factors. Convergence rates could be very different across algorithms, depending on whether between-level residual variances were zero in the population or in the fitted model. We discovered a worrying convergence issue with the default settings in M<i>plus</i>, resulting in seemingly converged solutions that are actually not. Rejection rates of the normal-theory test statistic were as expected, while rejection rates of the scaled test statistic were seriously inflated in several conditions. <i>Conclusions:</i> The defaults in M<i>plus</i> carry specific risks that are easily checked but not well advertised. Our results also shine a different light on earlier advice on the use of measurement models for shared factors.
ISSN:2624-8611