Donor-based imputation methods for admin data: How to replace the number of rooms question on the Census

The Census White Paper recommends removing the number of rooms question and to utilise administrative data instead. Previous research demonstrated that Valuation Office Agency (VOA) data can be used for this purpose in principal. However, users of Census micro-level data expect a utility dataset wit...

Full description

Bibliographic Details
Main Authors: Stephan Tietz, Andy Mealor, Fern Leather, Ali Dent
Format: Article
Language:English
Published: Swansea University 2019-11-01
Series:International Journal of Population Data Science
Online Access:https://ijpds.org/article/view/1299
_version_ 1797421694204248064
author Stephan Tietz
Andy Mealor
Fern Leather
Ali Dent
author_facet Stephan Tietz
Andy Mealor
Fern Leather
Ali Dent
author_sort Stephan Tietz
collection DOAJ
description The Census White Paper recommends removing the number of rooms question and to utilise administrative data instead. Previous research demonstrated that Valuation Office Agency (VOA) data can be used for this purpose in principal. However, users of Census micro-level data expect a utility dataset without missingness. This research project explored whether VOA number of rooms variable is suitable to undergo edit and imputation (E&I) within the standard census framework (i.e. donor imputation). We examined how linked admin and survey data challenges assumptions underlying the E&I process. We linked the 2011 Census with VOA data and attempted to impute VOA number of rooms using auxiliary variables from the Census to predict the missing and inconsistent values. This includes the question of whether we should allow questionnaire data to be changed where they are inconsistent with admin data. We demonstrated that it is possible to predict VOA number of rooms from census variables, despite some assumptions being partially violated (definitional and time-frame issues, the possibility of subpopulations in the data). This was true for missingness prior to linkage and missingness due to a data linkage failure. Additionally, we successfully tested the design principal of favouring survey data over alternative data where the two are inconsistent. We observed that some local authorities had large percentages of imputed data, and there was an increase in the percentage of properties where VOA number of rooms was equal to census number of bedrooms. This affects end-user interpretation of the data. The result is not a general endorsement that all linked survey-admin data can be effectively treated by standard E&I procedures. However, our research can be used as a blue print for other proof-of-concept studies on imputing admin data.
first_indexed 2024-03-09T07:21:08Z
format Article
id doaj.art-a87d84c18bb841d8b3a96ec5fab9e4b4
institution Directory Open Access Journal
issn 2399-4908
language English
last_indexed 2024-03-09T07:21:08Z
publishDate 2019-11-01
publisher Swansea University
record_format Article
series International Journal of Population Data Science
spelling doaj.art-a87d84c18bb841d8b3a96ec5fab9e4b42023-12-03T07:28:47ZengSwansea UniversityInternational Journal of Population Data Science2399-49082019-11-014310.23889/ijpds.v4i3.1299Donor-based imputation methods for admin data: How to replace the number of rooms question on the CensusStephan Tietz0Andy Mealor1Fern Leather2Ali Dent3Office for National StatisticsOffice for National StatisticsOffice for National StatisticsOffice for National StatisticsThe Census White Paper recommends removing the number of rooms question and to utilise administrative data instead. Previous research demonstrated that Valuation Office Agency (VOA) data can be used for this purpose in principal. However, users of Census micro-level data expect a utility dataset without missingness. This research project explored whether VOA number of rooms variable is suitable to undergo edit and imputation (E&I) within the standard census framework (i.e. donor imputation). We examined how linked admin and survey data challenges assumptions underlying the E&I process. We linked the 2011 Census with VOA data and attempted to impute VOA number of rooms using auxiliary variables from the Census to predict the missing and inconsistent values. This includes the question of whether we should allow questionnaire data to be changed where they are inconsistent with admin data. We demonstrated that it is possible to predict VOA number of rooms from census variables, despite some assumptions being partially violated (definitional and time-frame issues, the possibility of subpopulations in the data). This was true for missingness prior to linkage and missingness due to a data linkage failure. Additionally, we successfully tested the design principal of favouring survey data over alternative data where the two are inconsistent. We observed that some local authorities had large percentages of imputed data, and there was an increase in the percentage of properties where VOA number of rooms was equal to census number of bedrooms. This affects end-user interpretation of the data. The result is not a general endorsement that all linked survey-admin data can be effectively treated by standard E&I procedures. However, our research can be used as a blue print for other proof-of-concept studies on imputing admin data.https://ijpds.org/article/view/1299
spellingShingle Stephan Tietz
Andy Mealor
Fern Leather
Ali Dent
Donor-based imputation methods for admin data: How to replace the number of rooms question on the Census
International Journal of Population Data Science
title Donor-based imputation methods for admin data: How to replace the number of rooms question on the Census
title_full Donor-based imputation methods for admin data: How to replace the number of rooms question on the Census
title_fullStr Donor-based imputation methods for admin data: How to replace the number of rooms question on the Census
title_full_unstemmed Donor-based imputation methods for admin data: How to replace the number of rooms question on the Census
title_short Donor-based imputation methods for admin data: How to replace the number of rooms question on the Census
title_sort donor based imputation methods for admin data how to replace the number of rooms question on the census
url https://ijpds.org/article/view/1299
work_keys_str_mv AT stephantietz donorbasedimputationmethodsforadmindatahowtoreplacethenumberofroomsquestiononthecensus
AT andymealor donorbasedimputationmethodsforadmindatahowtoreplacethenumberofroomsquestiononthecensus
AT fernleather donorbasedimputationmethodsforadmindatahowtoreplacethenumberofroomsquestiononthecensus
AT alident donorbasedimputationmethodsforadmindatahowtoreplacethenumberofroomsquestiononthecensus