Multiple imputation to deal with missing EQ-5D-3L data: Should we impute individual domains or the actual index?

PURPOSE: Missing data are a well-known and widely documented problem in cost-effectiveness analyses alongside clinical trials using individual patient-level data. Current methodological research recommends multiple imputation (MI) to deal with missing health outcome data, but there is little guidanc...

Full description

Bibliographic Details
Main Authors: Simons, C, Rivero-Arias, O, Yu, L, Simon, J
Format: Journal article
Language:English
Published: Springer International Publishing 2015
_version_ 1797101372630368256
author Simons, C
Rivero-Arias, O
Yu, L
Simon, J
author_facet Simons, C
Rivero-Arias, O
Yu, L
Simon, J
author_sort Simons, C
collection OXFORD
description PURPOSE: Missing data are a well-known and widely documented problem in cost-effectiveness analyses alongside clinical trials using individual patient-level data. Current methodological research recommends multiple imputation (MI) to deal with missing health outcome data, but there is little guidance on whether MI for multi-attribute questionnaires, such as the EQ-5D-3L, should be carried out at domain or at summary score level. In this paper, we evaluated the impact of imputing individual domains versus imputing index values to deal with missing EQ-5D-3L data using a simulation study and developed recommendations for future practice. METHODS: We simulated missing data in a patient-level dataset with complete EQ-5D-3L data at one point in time from a large multinational clinical trial (n = 1,814). Different proportions of missing data were generated using a missing at random (MAR) mechanism and three different scenarios were studied. The performance of using each method was evaluated using root mean squared error and mean absolute error of the actual versus predicted EQ-5D-3L indices. RESULTS: In large sample sizes (n > 500) and a missing data pattern that follows mainly unit non-response, imputing domains or the index produced similar results. However, domain imputation became more accurate than index imputation with pattern of missingness following an item non-response. For smaller sample sizes (n < 100), index imputation was more accurate. When MI models were misspecified, both domain and index imputations were inaccurate for any proportion of missing data. CONCLUSIONS: The decision between imputing the domains or the EQ-5D-3L index scores depends on the observed missing data pattern and the sample size available for analysis. Analysts conducting this type of exercises should also evaluate the sensitivity of the analysis to the MAR assumption and whether the imputation model is correctly specified.
first_indexed 2024-03-07T05:50:57Z
format Journal article
id oxford-uuid:e8dbd58f-f70f-41b8-92f4-c55bf543ca74
institution University of Oxford
language English
last_indexed 2024-03-07T05:50:57Z
publishDate 2015
publisher Springer International Publishing
record_format dspace
spelling oxford-uuid:e8dbd58f-f70f-41b8-92f4-c55bf543ca742022-03-27T10:49:55ZMultiple imputation to deal with missing EQ-5D-3L data: Should we impute individual domains or the actual index?Journal articlehttp://purl.org/coar/resource_type/c_dcae04bcuuid:e8dbd58f-f70f-41b8-92f4-c55bf543ca74EnglishSymplectic Elements at OxfordSpringer International Publishing2015Simons, CRivero-Arias, OYu, LSimon, JPURPOSE: Missing data are a well-known and widely documented problem in cost-effectiveness analyses alongside clinical trials using individual patient-level data. Current methodological research recommends multiple imputation (MI) to deal with missing health outcome data, but there is little guidance on whether MI for multi-attribute questionnaires, such as the EQ-5D-3L, should be carried out at domain or at summary score level. In this paper, we evaluated the impact of imputing individual domains versus imputing index values to deal with missing EQ-5D-3L data using a simulation study and developed recommendations for future practice. METHODS: We simulated missing data in a patient-level dataset with complete EQ-5D-3L data at one point in time from a large multinational clinical trial (n = 1,814). Different proportions of missing data were generated using a missing at random (MAR) mechanism and three different scenarios were studied. The performance of using each method was evaluated using root mean squared error and mean absolute error of the actual versus predicted EQ-5D-3L indices. RESULTS: In large sample sizes (n > 500) and a missing data pattern that follows mainly unit non-response, imputing domains or the index produced similar results. However, domain imputation became more accurate than index imputation with pattern of missingness following an item non-response. For smaller sample sizes (n < 100), index imputation was more accurate. When MI models were misspecified, both domain and index imputations were inaccurate for any proportion of missing data. CONCLUSIONS: The decision between imputing the domains or the EQ-5D-3L index scores depends on the observed missing data pattern and the sample size available for analysis. Analysts conducting this type of exercises should also evaluate the sensitivity of the analysis to the MAR assumption and whether the imputation model is correctly specified.
spellingShingle Simons, C
Rivero-Arias, O
Yu, L
Simon, J
Multiple imputation to deal with missing EQ-5D-3L data: Should we impute individual domains or the actual index?
title Multiple imputation to deal with missing EQ-5D-3L data: Should we impute individual domains or the actual index?
title_full Multiple imputation to deal with missing EQ-5D-3L data: Should we impute individual domains or the actual index?
title_fullStr Multiple imputation to deal with missing EQ-5D-3L data: Should we impute individual domains or the actual index?
title_full_unstemmed Multiple imputation to deal with missing EQ-5D-3L data: Should we impute individual domains or the actual index?
title_short Multiple imputation to deal with missing EQ-5D-3L data: Should we impute individual domains or the actual index?
title_sort multiple imputation to deal with missing eq 5d 3l data should we impute individual domains or the actual index
work_keys_str_mv AT simonsc multipleimputationtodealwithmissingeq5d3ldatashouldweimputeindividualdomainsortheactualindex
AT riveroariaso multipleimputationtodealwithmissingeq5d3ldatashouldweimputeindividualdomainsortheactualindex
AT yul multipleimputationtodealwithmissingeq5d3ldatashouldweimputeindividualdomainsortheactualindex
AT simonj multipleimputationtodealwithmissingeq5d3ldatashouldweimputeindividualdomainsortheactualindex