A robust imputation method for missing responses and covariates in sample selection models

Sample selection arises when the outcome of interest is partially observed in a study. Although sophisticated statistical methods in the parametric and non-parametric framework have been proposed to solve this problem, it is yet unclear how to deal with selectively missing covariate data using simpl...

Full description

Bibliographic Details
Main Authors: Ogundimu, E, Collins, G
Format: Journal article
Language:English
Published: SAGE Publications 2017
_version_ 1826272616300675072
author Ogundimu, E
Collins, G
author_facet Ogundimu, E
Collins, G
author_sort Ogundimu, E
collection OXFORD
description Sample selection arises when the outcome of interest is partially observed in a study. Although sophisticated statistical methods in the parametric and non-parametric framework have been proposed to solve this problem, it is yet unclear how to deal with selectively missing covariate data using simple multiple imputation techniques, especially in the absence of exclusion restrictions and deviation from normality. Motivated by the 2003-2004 NHANES data, where previous authors have studied the effect of socio-economic status on blood pressure with missing data on income variable, we proposed the use of a robust imputation technique based on the selection-t sample selection model. The imputation method, which is developed within the frequentist framework, is compared with competing alternatives in a simulation study. The results indicate that the robust alternative is not susceptible to the absence of exclusion restrictions - a property inherited from the parent selection-t model - and performs better than models based on the normal assumption even when the data is generated from the normal distribution. Applications to missing outcome and covariate data further corroborate the robustness properties of the proposed method. We implemented the proposed approach within the MICE environment in R Statistical Software.
first_indexed 2024-03-06T22:15:23Z
format Journal article
id oxford-uuid:533a6c08-c1e6-41db-a0c9-34bd078d1b5d
institution University of Oxford
language English
last_indexed 2024-03-06T22:15:23Z
publishDate 2017
publisher SAGE Publications
record_format dspace
spelling oxford-uuid:533a6c08-c1e6-41db-a0c9-34bd078d1b5d2022-03-26T16:30:19ZA robust imputation method for missing responses and covariates in sample selection modelsJournal articlehttp://purl.org/coar/resource_type/c_dcae04bcuuid:533a6c08-c1e6-41db-a0c9-34bd078d1b5dEnglishSymplectic Elements at OxfordSAGE Publications2017Ogundimu, ECollins, GSample selection arises when the outcome of interest is partially observed in a study. Although sophisticated statistical methods in the parametric and non-parametric framework have been proposed to solve this problem, it is yet unclear how to deal with selectively missing covariate data using simple multiple imputation techniques, especially in the absence of exclusion restrictions and deviation from normality. Motivated by the 2003-2004 NHANES data, where previous authors have studied the effect of socio-economic status on blood pressure with missing data on income variable, we proposed the use of a robust imputation technique based on the selection-t sample selection model. The imputation method, which is developed within the frequentist framework, is compared with competing alternatives in a simulation study. The results indicate that the robust alternative is not susceptible to the absence of exclusion restrictions - a property inherited from the parent selection-t model - and performs better than models based on the normal assumption even when the data is generated from the normal distribution. Applications to missing outcome and covariate data further corroborate the robustness properties of the proposed method. We implemented the proposed approach within the MICE environment in R Statistical Software.
spellingShingle Ogundimu, E
Collins, G
A robust imputation method for missing responses and covariates in sample selection models
title A robust imputation method for missing responses and covariates in sample selection models
title_full A robust imputation method for missing responses and covariates in sample selection models
title_fullStr A robust imputation method for missing responses and covariates in sample selection models
title_full_unstemmed A robust imputation method for missing responses and covariates in sample selection models
title_short A robust imputation method for missing responses and covariates in sample selection models
title_sort robust imputation method for missing responses and covariates in sample selection models
work_keys_str_mv AT ogundimue arobustimputationmethodformissingresponsesandcovariatesinsampleselectionmodels
AT collinsg arobustimputationmethodformissingresponsesandcovariatesinsampleselectionmodels
AT ogundimue robustimputationmethodformissingresponsesandcovariatesinsampleselectionmodels
AT collinsg robustimputationmethodformissingresponsesandcovariatesinsampleselectionmodels