Multiple Imputation of Composite Covariates in Survival Studies

Missing covariate values are a common problem in survival studies, and the method of choice when handling such incomplete data is often multiple imputation. However, it is not obvious how this can be used most effectively when an incomplete covariate is a function of other covariates. For example, b...

Full description

Bibliographic Details
Main Authors: Lily Clements, Alan C. Kimber, Stefanie Biedermann
Format: Article
Language:English
Published: MDPI AG 2022-03-01
Series:Stats
Subjects:
Online Access:https://www.mdpi.com/2571-905X/5/2/20
Description
Summary:Missing covariate values are a common problem in survival studies, and the method of choice when handling such incomplete data is often multiple imputation. However, it is not obvious how this can be used most effectively when an incomplete covariate is a function of other covariates. For example, body mass index (BMI) is the ratio of weight and height-squared. In this situation, the following question arises: Should a composite covariate such as BMI be imputed directly, or is it advantageous to impute its constituents, weight and height, first and to construct BMI afterwards? We address this question through a carefully designed simulation study that compares various approaches to multiple imputation of composite covariates in a survival context. We discuss advantages and limitations of these approaches for various types of missingness and imputation models. Our results are a first step towards providing much needed guidance to practitioners for analysing their incomplete survival data effectively.
ISSN:2571-905X