On the aggregation of published prognostic scores for causal inference in observational studies

As real world evidence on drug efficacy involves nonrandomized studies, statistical methods adjusting for confounding are needed. In this context, prognostic score (PGS) analysis has recently been proposed as a method for causal inference. It aims to restore balance across the different treatment gr...

Full description

Bibliographic Details
Main Authors: Nguyen, T-L, Collins, GS, Pellegrini, F, Moons, KGM, Debray, TPA
Format: Journal article
Language:English
Published: Wiley 2020
_version_ 1797057192528969728
author Nguyen, T-L
Collins, GS
Pellegrini, F
Moons, KGM
Debray, TPA
author_facet Nguyen, T-L
Collins, GS
Pellegrini, F
Moons, KGM
Debray, TPA
author_sort Nguyen, T-L
collection OXFORD
description As real world evidence on drug efficacy involves nonrandomized studies, statistical methods adjusting for confounding are needed. In this context, prognostic score (PGS) analysis has recently been proposed as a method for causal inference. It aims to restore balance across the different treatment groups by identifying subjects with a similar prognosis for a given reference exposure ("control"). This requires the development of a multivariable prognostic model in the control arm of the study sample, which is then extrapolated to the different treatment arms. Unfortunately, large cohorts for developing prognostic models are not always available. Prognostic models are therefore subject to a dilemma between overfitting and parsimony; the latter being prone to a violation of the assumption of no unmeasured confounders when important covariates are ignored. Although it is possible to limit overfitting by using penalization strategies, an alternative approach is to adopt evidence synthesis. Aggregating previously published prognostic models may improve the generalizability of PGS, while taking account of a large set of covariates-even when limited individual participant data are available. In this article, we extend a method for prediction model aggregation to PGS analysis in nonrandomized studies. We conduct extensive simulations to assess the validity of model aggregation, compared with other methods of PGS analysis for estimating marginal treatment effects. We show that aggregating existing PGS into a "meta-score" is robust to misspecification, even when elementary scores wrongfully omit confounders or focus on different outcomes. We illustrate our methods in a setting of treatments for asthma.
first_indexed 2024-03-06T19:32:46Z
format Journal article
id oxford-uuid:1e094219-0908-4240-a9ce-994e1f539424
institution University of Oxford
language English
last_indexed 2024-03-06T19:32:46Z
publishDate 2020
publisher Wiley
record_format dspace
spelling oxford-uuid:1e094219-0908-4240-a9ce-994e1f5394242022-03-26T11:14:10ZOn the aggregation of published prognostic scores for causal inference in observational studiesJournal articlehttp://purl.org/coar/resource_type/c_dcae04bcuuid:1e094219-0908-4240-a9ce-994e1f539424EnglishSymplectic ElementsWiley2020Nguyen, T-LCollins, GSPellegrini, FMoons, KGMDebray, TPAAs real world evidence on drug efficacy involves nonrandomized studies, statistical methods adjusting for confounding are needed. In this context, prognostic score (PGS) analysis has recently been proposed as a method for causal inference. It aims to restore balance across the different treatment groups by identifying subjects with a similar prognosis for a given reference exposure ("control"). This requires the development of a multivariable prognostic model in the control arm of the study sample, which is then extrapolated to the different treatment arms. Unfortunately, large cohorts for developing prognostic models are not always available. Prognostic models are therefore subject to a dilemma between overfitting and parsimony; the latter being prone to a violation of the assumption of no unmeasured confounders when important covariates are ignored. Although it is possible to limit overfitting by using penalization strategies, an alternative approach is to adopt evidence synthesis. Aggregating previously published prognostic models may improve the generalizability of PGS, while taking account of a large set of covariates-even when limited individual participant data are available. In this article, we extend a method for prediction model aggregation to PGS analysis in nonrandomized studies. We conduct extensive simulations to assess the validity of model aggregation, compared with other methods of PGS analysis for estimating marginal treatment effects. We show that aggregating existing PGS into a "meta-score" is robust to misspecification, even when elementary scores wrongfully omit confounders or focus on different outcomes. We illustrate our methods in a setting of treatments for asthma.
spellingShingle Nguyen, T-L
Collins, GS
Pellegrini, F
Moons, KGM
Debray, TPA
On the aggregation of published prognostic scores for causal inference in observational studies
title On the aggregation of published prognostic scores for causal inference in observational studies
title_full On the aggregation of published prognostic scores for causal inference in observational studies
title_fullStr On the aggregation of published prognostic scores for causal inference in observational studies
title_full_unstemmed On the aggregation of published prognostic scores for causal inference in observational studies
title_short On the aggregation of published prognostic scores for causal inference in observational studies
title_sort on the aggregation of published prognostic scores for causal inference in observational studies
work_keys_str_mv AT nguyentl ontheaggregationofpublishedprognosticscoresforcausalinferenceinobservationalstudies
AT collinsgs ontheaggregationofpublishedprognosticscoresforcausalinferenceinobservationalstudies
AT pellegrinif ontheaggregationofpublishedprognosticscoresforcausalinferenceinobservationalstudies
AT moonskgm ontheaggregationofpublishedprognosticscoresforcausalinferenceinobservationalstudies
AT debraytpa ontheaggregationofpublishedprognosticscoresforcausalinferenceinobservationalstudies