Random effects modelling versus logistic regression for the inclusion of cluster-level covariates in propensity score estimation: a Monte Carlo simulation and registry cohort analysis

<strong>Purpose: </strong>Surgeon and hospital-related features, such as volume, can be associated with treatment choices and outcomes. Accounting for these covariates with propensity score (PS) analysis can be challenging due to the clustered nature of the data. We studied six different...

Повний опис

Бібліографічні деталі
Автори: Du, MD, Prats-Uribe, A, Khalid, S, Prieto-Alhambra, D, Strauss, VY
Формат: Journal article
Мова:English
Опубліковано: Frontiers Media 2023
_version_ 1826311099367030784
author Du, MD
Prats-Uribe, A
Khalid, S
Prieto-Alhambra, D
Strauss, VY
author_facet Du, MD
Prats-Uribe, A
Khalid, S
Prieto-Alhambra, D
Strauss, VY
author_sort Du, MD
collection OXFORD
description <strong>Purpose: </strong>Surgeon and hospital-related features, such as volume, can be associated with treatment choices and outcomes. Accounting for these covariates with propensity score (PS) analysis can be challenging due to the clustered nature of the data. We studied six different PS estimation strategies for clustered data using random effects modelling (REM) compared with logistic regression. <br><strong> Methods: </strong>Monte Carlo simulations were used to generate variable cluster-level confounding intensity [odds ratio (OR) = 1.01–2.5] and cluster size (20–1,000 patients per cluster). The following PS estimation strategies were compared: i) logistic regression omitting cluster-level confounders; ii) logistic regression including cluster-level confounders; iii) the same as ii) but including cross-level interactions; iv), v), and vi), similar to i), ii), and iii), respectively, but using REM instead of logistic regression. The same strategies were tested in a trial emulation of partial versus total knee replacement (TKR) surgery, where observational versus trial-based estimates were compared as a proxy for bias. Performance metrics included bias and mean square error (MSE). <br><strong> Results: </strong>In most simulated scenarios, logistic regression, including cluster-level confounders, led to the lowest bias and MSE, for example, with 50 clusters × 200 individuals and confounding intensity OR = 1.5, a relative bias of 10%, and MSE of 0.003 for (i) compared to 32% and 0.010 for (iv). The results from the trial emulation also gave similar trends. <br><strong> Conclusion: </strong>Logistic regression, including patient and surgeon-/hospital-level confounders, appears to be the preferred strategy for PS estimation.
first_indexed 2024-03-07T08:03:22Z
format Journal article
id oxford-uuid:43b87dc0-eacc-4a8d-832e-93cd0d8784cc
institution University of Oxford
language English
last_indexed 2024-03-07T08:03:22Z
publishDate 2023
publisher Frontiers Media
record_format dspace
spelling oxford-uuid:43b87dc0-eacc-4a8d-832e-93cd0d8784cc2023-10-16T11:53:05ZRandom effects modelling versus logistic regression for the inclusion of cluster-level covariates in propensity score estimation: a Monte Carlo simulation and registry cohort analysisJournal articlehttp://purl.org/coar/resource_type/c_dcae04bcuuid:43b87dc0-eacc-4a8d-832e-93cd0d8784ccEnglishSymplectic ElementsFrontiers Media2023Du, MDPrats-Uribe, AKhalid, SPrieto-Alhambra, DStrauss, VY<strong>Purpose: </strong>Surgeon and hospital-related features, such as volume, can be associated with treatment choices and outcomes. Accounting for these covariates with propensity score (PS) analysis can be challenging due to the clustered nature of the data. We studied six different PS estimation strategies for clustered data using random effects modelling (REM) compared with logistic regression. <br><strong> Methods: </strong>Monte Carlo simulations were used to generate variable cluster-level confounding intensity [odds ratio (OR) = 1.01–2.5] and cluster size (20–1,000 patients per cluster). The following PS estimation strategies were compared: i) logistic regression omitting cluster-level confounders; ii) logistic regression including cluster-level confounders; iii) the same as ii) but including cross-level interactions; iv), v), and vi), similar to i), ii), and iii), respectively, but using REM instead of logistic regression. The same strategies were tested in a trial emulation of partial versus total knee replacement (TKR) surgery, where observational versus trial-based estimates were compared as a proxy for bias. Performance metrics included bias and mean square error (MSE). <br><strong> Results: </strong>In most simulated scenarios, logistic regression, including cluster-level confounders, led to the lowest bias and MSE, for example, with 50 clusters × 200 individuals and confounding intensity OR = 1.5, a relative bias of 10%, and MSE of 0.003 for (i) compared to 32% and 0.010 for (iv). The results from the trial emulation also gave similar trends. <br><strong> Conclusion: </strong>Logistic regression, including patient and surgeon-/hospital-level confounders, appears to be the preferred strategy for PS estimation.
spellingShingle Du, MD
Prats-Uribe, A
Khalid, S
Prieto-Alhambra, D
Strauss, VY
Random effects modelling versus logistic regression for the inclusion of cluster-level covariates in propensity score estimation: a Monte Carlo simulation and registry cohort analysis
title Random effects modelling versus logistic regression for the inclusion of cluster-level covariates in propensity score estimation: a Monte Carlo simulation and registry cohort analysis
title_full Random effects modelling versus logistic regression for the inclusion of cluster-level covariates in propensity score estimation: a Monte Carlo simulation and registry cohort analysis
title_fullStr Random effects modelling versus logistic regression for the inclusion of cluster-level covariates in propensity score estimation: a Monte Carlo simulation and registry cohort analysis
title_full_unstemmed Random effects modelling versus logistic regression for the inclusion of cluster-level covariates in propensity score estimation: a Monte Carlo simulation and registry cohort analysis
title_short Random effects modelling versus logistic regression for the inclusion of cluster-level covariates in propensity score estimation: a Monte Carlo simulation and registry cohort analysis
title_sort random effects modelling versus logistic regression for the inclusion of cluster level covariates in propensity score estimation a monte carlo simulation and registry cohort analysis
work_keys_str_mv AT dumd randomeffectsmodellingversuslogisticregressionfortheinclusionofclusterlevelcovariatesinpropensityscoreestimationamontecarlosimulationandregistrycohortanalysis
AT pratsuribea randomeffectsmodellingversuslogisticregressionfortheinclusionofclusterlevelcovariatesinpropensityscoreestimationamontecarlosimulationandregistrycohortanalysis
AT khalids randomeffectsmodellingversuslogisticregressionfortheinclusionofclusterlevelcovariatesinpropensityscoreestimationamontecarlosimulationandregistrycohortanalysis
AT prietoalhambrad randomeffectsmodellingversuslogisticregressionfortheinclusionofclusterlevelcovariatesinpropensityscoreestimationamontecarlosimulationandregistrycohortanalysis
AT straussvy randomeffectsmodellingversuslogisticregressionfortheinclusionofclusterlevelcovariatesinpropensityscoreestimationamontecarlosimulationandregistrycohortanalysis