A Monte Carlo sample size study: how many countries are needed for accurate multilevel SEM?

Recently, there has been growing scientific interest for cross-national survey research. Various scholars have used multilevel techniques to link individual characteristics to aspects of the national context. At first sight, multilevel SEM seems to be a promising tool for this purpose, as it integra...

Full description

Bibliographic Details
Main Authors: Bart Meuleman, Jaak Billiet
Format: Article
Language:English
Published: European Survey Research Association 2009-03-01
Series:Survey Research Methods
Subjects:
Online Access:https://ojs.ub.uni-konstanz.de/srm/article/view/666
Description
Summary:Recently, there has been growing scientific interest for cross-national survey research. Various scholars have used multilevel techniques to link individual characteristics to aspects of the national context. At first sight, multilevel SEM seems to be a promising tool for this purpose, as it integrates multilevel modeling within a latent variable framework. However, due to the fact that the number of countries in most international surveys does not exceed 30, the application of multilevel SEM in cross-national research is problematic. Taking European Social Survey (ESS) data as a point of departure, this paper uses Monte Carlo studies to assess the estimation accuracy of multilevel SEM with small group sample sizes. The results indicate that a group sample size of 20 - a situation common in cross-national research - does not guarantee accurate estimation at all. Unacceptable amounts of parameter and standard error bias are present for the between-level estimates. Unless the standardized effect is very large (0.75), statistical power for detecting a significant between-level structural effect is seriously lacking. Required group sample sizes depend strongly on the specific interests of the researcher, the expected effect sizes and the complexity of the model. If the between-level model is relatively simple and one is merely interested in the between-level factor structure, a group sample size of 40 could be sufficient. To detect large (>0.50) structural effects at the between level, at least 60 groups are required. To have an acceptable probability of detecting smaller effects, more than 100 groups are needed. These guidelines are shown to be quite robust for varying cluster sizes and intra-class correlations (ICCs).
ISSN:1864-3361