Exploring households experiencing and at risk of homelessness: Linking homelessness case level data to Census 2021

This paper describes linkage of Homelessness Case Level Collection (H-CLIC) data to Census 2021, allowing researchers to better understand households experiencing and at risk of homelessness. This fits within a wider portfolio of Better Outcomes Through Linked Data (BOLD) pilot projects, developing...

Full description

Bibliographic Details
Main Authors: Dilly Stephenson, Jen Hampton
Format: Article
Language:English
Published: Swansea University 2023-09-01
Series:International Journal of Population Data Science
Online Access:https://ijpds.org/article/view/2317
_version_ 1827607385828491264
author Dilly Stephenson
Jen Hampton
author_facet Dilly Stephenson
Jen Hampton
author_sort Dilly Stephenson
collection DOAJ
description This paper describes linkage of Homelessness Case Level Collection (H-CLIC) data to Census 2021, allowing researchers to better understand households experiencing and at risk of homelessness. This fits within a wider portfolio of Better Outcomes Through Linked Data (BOLD) pilot projects, developing research possibilities and partnerships within the homelessness research sphere. To mitigate impact from limited and lower quality variables, linkage was performed in two phases: individual-level linkage and associative matching. Individual-level linkage combined deterministic and probabilistic linkage methods, supported using Splink, with two stages of probabilistic linkage with differing extents of geographical blocking. Clerical-focused associative matching was employed following individual linkage, using UPRN and household make-up. The project took a pipeline approach, with consideration to future-proofing the pipeline to facilitate annual updates to the H-CLIC spine. Also considered in detail are the quality implications of differing temporal data in linking dynamic H-CLIC data to static Census data. The linkage resulted in construction of a research-ready dataset, which enables H-CLIC applicants to be joined to their Census records. To compensate for limited address information and support best use of processing resources, blocking was used to limit search spaces in the probabilistic linkage. Households with missing links between individual members were successfully identified and passed through associative matching methods. Though matching was successful, the remaining residuals were challenging to deal with. Error introduced by the linkage methodology is presented via linkage quality metrics and bias analysis. Bias introduced by the linkage processing may be of particular concern, particularly considering coverage of these sample data. Within the fifty-three current Local Authorities included, a skew was observed toward smaller, southernly areas with lower homelessness rates. The linked dataset enables research into households vulnerable to homelessness, their movements across England, and their exposure to repeated homelessness. The data facilitates research into this important topic, supporting evidence-based policymaking. Furthermore, such use is anticipated to encourage further local authorities to provide their data, broadening its reach and our understanding.
first_indexed 2024-03-09T06:52:43Z
format Article
id doaj.art-f64e51297e914f38a9f80e9be439ba1a
institution Directory Open Access Journal
issn 2399-4908
language English
last_indexed 2024-03-09T06:52:43Z
publishDate 2023-09-01
publisher Swansea University
record_format Article
series International Journal of Population Data Science
spelling doaj.art-f64e51297e914f38a9f80e9be439ba1a2023-12-03T10:22:14ZengSwansea UniversityInternational Journal of Population Data Science2399-49082023-09-018210.23889/ijpds.v8i2.2317Exploring households experiencing and at risk of homelessness: Linking homelessness case level data to Census 2021Dilly Stephenson0Jen Hampton1Office for National Statistics, Newport, United KingdomOffice for National Statistics, Newport, United Kingdom This paper describes linkage of Homelessness Case Level Collection (H-CLIC) data to Census 2021, allowing researchers to better understand households experiencing and at risk of homelessness. This fits within a wider portfolio of Better Outcomes Through Linked Data (BOLD) pilot projects, developing research possibilities and partnerships within the homelessness research sphere. To mitigate impact from limited and lower quality variables, linkage was performed in two phases: individual-level linkage and associative matching. Individual-level linkage combined deterministic and probabilistic linkage methods, supported using Splink, with two stages of probabilistic linkage with differing extents of geographical blocking. Clerical-focused associative matching was employed following individual linkage, using UPRN and household make-up. The project took a pipeline approach, with consideration to future-proofing the pipeline to facilitate annual updates to the H-CLIC spine. Also considered in detail are the quality implications of differing temporal data in linking dynamic H-CLIC data to static Census data. The linkage resulted in construction of a research-ready dataset, which enables H-CLIC applicants to be joined to their Census records. To compensate for limited address information and support best use of processing resources, blocking was used to limit search spaces in the probabilistic linkage. Households with missing links between individual members were successfully identified and passed through associative matching methods. Though matching was successful, the remaining residuals were challenging to deal with. Error introduced by the linkage methodology is presented via linkage quality metrics and bias analysis. Bias introduced by the linkage processing may be of particular concern, particularly considering coverage of these sample data. Within the fifty-three current Local Authorities included, a skew was observed toward smaller, southernly areas with lower homelessness rates. The linked dataset enables research into households vulnerable to homelessness, their movements across England, and their exposure to repeated homelessness. The data facilitates research into this important topic, supporting evidence-based policymaking. Furthermore, such use is anticipated to encourage further local authorities to provide their data, broadening its reach and our understanding. https://ijpds.org/article/view/2317
spellingShingle Dilly Stephenson
Jen Hampton
Exploring households experiencing and at risk of homelessness: Linking homelessness case level data to Census 2021
International Journal of Population Data Science
title Exploring households experiencing and at risk of homelessness: Linking homelessness case level data to Census 2021
title_full Exploring households experiencing and at risk of homelessness: Linking homelessness case level data to Census 2021
title_fullStr Exploring households experiencing and at risk of homelessness: Linking homelessness case level data to Census 2021
title_full_unstemmed Exploring households experiencing and at risk of homelessness: Linking homelessness case level data to Census 2021
title_short Exploring households experiencing and at risk of homelessness: Linking homelessness case level data to Census 2021
title_sort exploring households experiencing and at risk of homelessness linking homelessness case level data to census 2021
url https://ijpds.org/article/view/2317
work_keys_str_mv AT dillystephenson exploringhouseholdsexperiencingandatriskofhomelessnesslinkinghomelessnesscaseleveldatatocensus2021
AT jenhampton exploringhouseholdsexperiencingandatriskofhomelessnesslinkinghomelessnesscaseleveldatatocensus2021