Is using multiple imputation better than complete case analysis for estimating a prevalence (risk) difference in randomized controlled trials when binary outcome observations are missing?
<h4>Background</h4> <p>Missing outcomes can seriously impair the ability to make correct inferences from randomized controlled trials (RCTs). Complete case (CC) analysis is commonly used, but it reduces sample size and is perceived to lead to reduced statistical efficiency of esti...
Main Authors: | , , , , , |
---|---|
Format: | Journal article |
Language: | English |
Published: |
BioMed Central
2016
|
_version_ | 1826260324503781376 |
---|---|
author | Mukaka, M White, SA Terlouw, DJ Mwapasa, V Kalilani-Phiri, L Faragher, EB |
author_facet | Mukaka, M White, SA Terlouw, DJ Mwapasa, V Kalilani-Phiri, L Faragher, EB |
author_sort | Mukaka, M |
collection | OXFORD |
description | <h4>Background</h4> <p>Missing outcomes can seriously impair the ability to make correct inferences from randomized controlled trials (RCTs). Complete case (CC) analysis is commonly used, but it reduces sample size and is perceived to lead to reduced statistical efficiency of estimates while increasing the potential for bias. As multiple imputation (MI) methods preserve sample size, they are generally viewed as the preferred analytical approach. We examined this assumption, comparing the performance of CC and MI methods to determine risk difference (RD) estimates in the presence of missing binary outcomes. We conducted simulation studies of 5000 simulated data sets with 50 imputations of RCTs with one primary follow-up endpoint at different underlying levels of RD (3–25 %) and missing outcomes (5–30 %).</p> <h4>Results</h4> <p>For missing at random (MAR) or missing completely at random (MCAR) outcomes, CC method estimates generally remained unbiased and achieved precision similar to or better than MI methods, and high statistical coverage. Missing not at random (MNAR) scenarios yielded invalid inferences with both methods. Effect size estimate bias was reduced in MI methods by always including group membership even if this was unrelated to missingness. Surprisingly, under MAR and MCAR conditions in the assessed scenarios, MI offered no statistical advantage over CC methods.</p> <h4>Conclusions</h4> <p>While MI must inherently accompany CC methods for intention-to-treat analyses, these findings endorse CC methods for per protocol risk difference analyses in these conditions. These findings provide an argument for the use of the CC approach to always complement MI analyses, with the usual caveat that the validity of the mechanism for missingness be thoroughly discussed. More importantly, researchers should strive to collect as much data as possible.</p> |
first_indexed | 2024-03-06T19:03:49Z |
format | Journal article |
id | oxford-uuid:1471adbe-635d-4a29-93e8-bd8cc38e9377 |
institution | University of Oxford |
language | English |
last_indexed | 2024-03-06T19:03:49Z |
publishDate | 2016 |
publisher | BioMed Central |
record_format | dspace |
spelling | oxford-uuid:1471adbe-635d-4a29-93e8-bd8cc38e93772022-03-26T10:19:53ZIs using multiple imputation better than complete case analysis for estimating a prevalence (risk) difference in randomized controlled trials when binary outcome observations are missing?Journal articlehttp://purl.org/coar/resource_type/c_dcae04bcuuid:1471adbe-635d-4a29-93e8-bd8cc38e9377EnglishSymplectic Elements at OxfordBioMed Central2016Mukaka, MWhite, SATerlouw, DJMwapasa, VKalilani-Phiri, LFaragher, EB <h4>Background</h4> <p>Missing outcomes can seriously impair the ability to make correct inferences from randomized controlled trials (RCTs). Complete case (CC) analysis is commonly used, but it reduces sample size and is perceived to lead to reduced statistical efficiency of estimates while increasing the potential for bias. As multiple imputation (MI) methods preserve sample size, they are generally viewed as the preferred analytical approach. We examined this assumption, comparing the performance of CC and MI methods to determine risk difference (RD) estimates in the presence of missing binary outcomes. We conducted simulation studies of 5000 simulated data sets with 50 imputations of RCTs with one primary follow-up endpoint at different underlying levels of RD (3–25 %) and missing outcomes (5–30 %).</p> <h4>Results</h4> <p>For missing at random (MAR) or missing completely at random (MCAR) outcomes, CC method estimates generally remained unbiased and achieved precision similar to or better than MI methods, and high statistical coverage. Missing not at random (MNAR) scenarios yielded invalid inferences with both methods. Effect size estimate bias was reduced in MI methods by always including group membership even if this was unrelated to missingness. Surprisingly, under MAR and MCAR conditions in the assessed scenarios, MI offered no statistical advantage over CC methods.</p> <h4>Conclusions</h4> <p>While MI must inherently accompany CC methods for intention-to-treat analyses, these findings endorse CC methods for per protocol risk difference analyses in these conditions. These findings provide an argument for the use of the CC approach to always complement MI analyses, with the usual caveat that the validity of the mechanism for missingness be thoroughly discussed. More importantly, researchers should strive to collect as much data as possible.</p> |
spellingShingle | Mukaka, M White, SA Terlouw, DJ Mwapasa, V Kalilani-Phiri, L Faragher, EB Is using multiple imputation better than complete case analysis for estimating a prevalence (risk) difference in randomized controlled trials when binary outcome observations are missing? |
title | Is using multiple imputation better than complete case analysis for estimating a prevalence (risk) difference in randomized controlled trials when binary outcome observations are missing? |
title_full | Is using multiple imputation better than complete case analysis for estimating a prevalence (risk) difference in randomized controlled trials when binary outcome observations are missing? |
title_fullStr | Is using multiple imputation better than complete case analysis for estimating a prevalence (risk) difference in randomized controlled trials when binary outcome observations are missing? |
title_full_unstemmed | Is using multiple imputation better than complete case analysis for estimating a prevalence (risk) difference in randomized controlled trials when binary outcome observations are missing? |
title_short | Is using multiple imputation better than complete case analysis for estimating a prevalence (risk) difference in randomized controlled trials when binary outcome observations are missing? |
title_sort | is using multiple imputation better than complete case analysis for estimating a prevalence risk difference in randomized controlled trials when binary outcome observations are missing |
work_keys_str_mv | AT mukakam isusingmultipleimputationbetterthancompletecaseanalysisforestimatingaprevalenceriskdifferenceinrandomizedcontrolledtrialswhenbinaryoutcomeobservationsaremissing AT whitesa isusingmultipleimputationbetterthancompletecaseanalysisforestimatingaprevalenceriskdifferenceinrandomizedcontrolledtrialswhenbinaryoutcomeobservationsaremissing AT terlouwdj isusingmultipleimputationbetterthancompletecaseanalysisforestimatingaprevalenceriskdifferenceinrandomizedcontrolledtrialswhenbinaryoutcomeobservationsaremissing AT mwapasav isusingmultipleimputationbetterthancompletecaseanalysisforestimatingaprevalenceriskdifferenceinrandomizedcontrolledtrialswhenbinaryoutcomeobservationsaremissing AT kalilaniphiril isusingmultipleimputationbetterthancompletecaseanalysisforestimatingaprevalenceriskdifferenceinrandomizedcontrolledtrialswhenbinaryoutcomeobservationsaremissing AT faraghereb isusingmultipleimputationbetterthancompletecaseanalysisforestimatingaprevalenceriskdifferenceinrandomizedcontrolledtrialswhenbinaryoutcomeobservationsaremissing |