Exploring households experiencing and at risk of homelessness: Linking homelessness case level data to Census 2021
This paper describes linkage of Homelessness Case Level Collection (H-CLIC) data to Census 2021, allowing researchers to better understand households experiencing and at risk of homelessness. This fits within a wider portfolio of Better Outcomes Through Linked Data (BOLD) pilot projects, developing...
Main Authors: | , |
---|---|
Format: | Article |
Language: | English |
Published: |
Swansea University
2023-09-01
|
Series: | International Journal of Population Data Science |
Online Access: | https://ijpds.org/article/view/2317 |
_version_ | 1827607385828491264 |
---|---|
author | Dilly Stephenson Jen Hampton |
author_facet | Dilly Stephenson Jen Hampton |
author_sort | Dilly Stephenson |
collection | DOAJ |
description |
This paper describes linkage of Homelessness Case Level Collection (H-CLIC) data to Census 2021, allowing researchers to better understand households experiencing and at risk of homelessness. This fits within a wider portfolio of Better Outcomes Through Linked Data (BOLD) pilot projects, developing research possibilities and partnerships within the homelessness research sphere.
To mitigate impact from limited and lower quality variables, linkage was performed in two phases: individual-level linkage and associative matching. Individual-level linkage combined deterministic and probabilistic linkage methods, supported using Splink, with two stages of probabilistic linkage with differing extents of geographical blocking. Clerical-focused associative matching was employed following individual linkage, using UPRN and household make-up.
The project took a pipeline approach, with consideration to future-proofing the pipeline to facilitate annual updates to the H-CLIC spine. Also considered in detail are the quality implications of differing temporal data in linking dynamic H-CLIC data to static Census data.
The linkage resulted in construction of a research-ready dataset, which enables H-CLIC applicants to be joined to their Census records. To compensate for limited address information and support best use of processing resources, blocking was used to limit search spaces in the probabilistic linkage. Households with missing links between individual members were successfully identified and passed through associative matching methods. Though matching was successful, the remaining residuals were challenging to deal with.
Error introduced by the linkage methodology is presented via linkage quality metrics and bias analysis. Bias introduced by the linkage processing may be of particular concern, particularly considering coverage of these sample data. Within the fifty-three current Local Authorities included, a skew was observed toward smaller, southernly areas with lower homelessness rates.
The linked dataset enables research into households vulnerable to homelessness, their movements across England, and their exposure to repeated homelessness. The data facilitates research into this important topic, supporting evidence-based policymaking. Furthermore, such use is anticipated to encourage further local authorities to provide their data, broadening its reach and our understanding.
|
first_indexed | 2024-03-09T06:52:43Z |
format | Article |
id | doaj.art-f64e51297e914f38a9f80e9be439ba1a |
institution | Directory Open Access Journal |
issn | 2399-4908 |
language | English |
last_indexed | 2024-03-09T06:52:43Z |
publishDate | 2023-09-01 |
publisher | Swansea University |
record_format | Article |
series | International Journal of Population Data Science |
spelling | doaj.art-f64e51297e914f38a9f80e9be439ba1a2023-12-03T10:22:14ZengSwansea UniversityInternational Journal of Population Data Science2399-49082023-09-018210.23889/ijpds.v8i2.2317Exploring households experiencing and at risk of homelessness: Linking homelessness case level data to Census 2021Dilly Stephenson0Jen Hampton1Office for National Statistics, Newport, United KingdomOffice for National Statistics, Newport, United Kingdom This paper describes linkage of Homelessness Case Level Collection (H-CLIC) data to Census 2021, allowing researchers to better understand households experiencing and at risk of homelessness. This fits within a wider portfolio of Better Outcomes Through Linked Data (BOLD) pilot projects, developing research possibilities and partnerships within the homelessness research sphere. To mitigate impact from limited and lower quality variables, linkage was performed in two phases: individual-level linkage and associative matching. Individual-level linkage combined deterministic and probabilistic linkage methods, supported using Splink, with two stages of probabilistic linkage with differing extents of geographical blocking. Clerical-focused associative matching was employed following individual linkage, using UPRN and household make-up. The project took a pipeline approach, with consideration to future-proofing the pipeline to facilitate annual updates to the H-CLIC spine. Also considered in detail are the quality implications of differing temporal data in linking dynamic H-CLIC data to static Census data. The linkage resulted in construction of a research-ready dataset, which enables H-CLIC applicants to be joined to their Census records. To compensate for limited address information and support best use of processing resources, blocking was used to limit search spaces in the probabilistic linkage. Households with missing links between individual members were successfully identified and passed through associative matching methods. Though matching was successful, the remaining residuals were challenging to deal with. Error introduced by the linkage methodology is presented via linkage quality metrics and bias analysis. Bias introduced by the linkage processing may be of particular concern, particularly considering coverage of these sample data. Within the fifty-three current Local Authorities included, a skew was observed toward smaller, southernly areas with lower homelessness rates. The linked dataset enables research into households vulnerable to homelessness, their movements across England, and their exposure to repeated homelessness. The data facilitates research into this important topic, supporting evidence-based policymaking. Furthermore, such use is anticipated to encourage further local authorities to provide their data, broadening its reach and our understanding. https://ijpds.org/article/view/2317 |
spellingShingle | Dilly Stephenson Jen Hampton Exploring households experiencing and at risk of homelessness: Linking homelessness case level data to Census 2021 International Journal of Population Data Science |
title | Exploring households experiencing and at risk of homelessness: Linking homelessness case level data to Census 2021 |
title_full | Exploring households experiencing and at risk of homelessness: Linking homelessness case level data to Census 2021 |
title_fullStr | Exploring households experiencing and at risk of homelessness: Linking homelessness case level data to Census 2021 |
title_full_unstemmed | Exploring households experiencing and at risk of homelessness: Linking homelessness case level data to Census 2021 |
title_short | Exploring households experiencing and at risk of homelessness: Linking homelessness case level data to Census 2021 |
title_sort | exploring households experiencing and at risk of homelessness linking homelessness case level data to census 2021 |
url | https://ijpds.org/article/view/2317 |
work_keys_str_mv | AT dillystephenson exploringhouseholdsexperiencingandatriskofhomelessnesslinkinghomelessnesscaseleveldatatocensus2021 AT jenhampton exploringhouseholdsexperiencingandatriskofhomelessnesslinkinghomelessnesscaseleveldatatocensus2021 |