Bespoke automated linkage to enable analysis of covid deaths by ethnicity.

In early 2020 there was intense media speculation that ethnicity and Covid-19 deaths were correlated. However, the existing method of adding ethnicity to death records resulted in low linkage rates for very recent deaths. We designed and implemented a bespoke linkage in three days enabling accurate...

Full description

Bibliographic Details
Main Authors: Shelley Gammon, Rachel Shipsey, Charlie Tomlin, Josie Plachta
Format: Article
Language:English
Published: Swansea University 2022-08-01
Series:International Journal of Population Data Science
Subjects:
Online Access:https://ijpds.org/article/view/2050
_version_ 1797428837990006784
author Shelley Gammon
Rachel Shipsey
Charlie Tomlin
Josie Plachta
author_facet Shelley Gammon
Rachel Shipsey
Charlie Tomlin
Josie Plachta
author_sort Shelley Gammon
collection DOAJ
description In early 2020 there was intense media speculation that ethnicity and Covid-19 deaths were correlated. However, the existing method of adding ethnicity to death records resulted in low linkage rates for very recent deaths. We designed and implemented a bespoke linkage in three days enabling accurate reporting to the nation. We linked the 2011 England and Wales Census to death records using a range of personal identifiers. Due to time pressure, we focused on executing a single linkage method well. Deterministic linkage was chosen, using a variety of matchkeys which were tested via clerical review. To overcome the issue of addresses changing since 2011, we also linked 2020 death record residuals to the 2019 Patient Register (PR) and then made use of the 2011 PR address where it existed.  This additionally provided an indication of whether unmatched death records might be attributable to migration into England and Wales post-2011. The prior linking method used NHS Number only. Although the overall linkage rate was approximately 90%, the rate for recent deaths (2nd March 2020 to 10th April 2020 in the first iteration of the linkage) was closer to 30% due to an administrative lag in adding NHS Numbers to death records. Our novel bespoke linkage method linked over 39,000 extra death records. Whilst this had minimal impact on the overall linkage rate, it improved the linkage rate for recent deaths to approximately 90%. This was without an impact on accuracy: clerical review demonstrated that the false positive rate was approximately 0.2%. A report was published using this data showing that the risk of death involving Covid-19 among some ethnic groups was significantly higher than others. Determining whether Covid-19 disproportionally affected certain ethnicities was of crucial importance in the early phase of the pandemic to enable appropriate government strategies to be developed. We delivered a bespoke linkage under an exceptional time-limit without compromising on accuracy, enabling this impactful analysis with nation-wide interest and impact.
first_indexed 2024-03-09T09:04:36Z
format Article
id doaj.art-e6271681de284e199f60f4971d064bca
institution Directory Open Access Journal
issn 2399-4908
language English
last_indexed 2024-03-09T09:04:36Z
publishDate 2022-08-01
publisher Swansea University
record_format Article
series International Journal of Population Data Science
spelling doaj.art-e6271681de284e199f60f4971d064bca2023-12-02T10:47:16ZengSwansea UniversityInternational Journal of Population Data Science2399-49082022-08-017310.23889/ijpds.v7i3.2050Bespoke automated linkage to enable analysis of covid deaths by ethnicity.Shelley Gammon0Rachel Shipsey1Charlie Tomlin2Josie Plachta3Office for National StatisticsOffice for National StatisticsOffice for National StatisticsOffice for National Statistics In early 2020 there was intense media speculation that ethnicity and Covid-19 deaths were correlated. However, the existing method of adding ethnicity to death records resulted in low linkage rates for very recent deaths. We designed and implemented a bespoke linkage in three days enabling accurate reporting to the nation. We linked the 2011 England and Wales Census to death records using a range of personal identifiers. Due to time pressure, we focused on executing a single linkage method well. Deterministic linkage was chosen, using a variety of matchkeys which were tested via clerical review. To overcome the issue of addresses changing since 2011, we also linked 2020 death record residuals to the 2019 Patient Register (PR) and then made use of the 2011 PR address where it existed.  This additionally provided an indication of whether unmatched death records might be attributable to migration into England and Wales post-2011. The prior linking method used NHS Number only. Although the overall linkage rate was approximately 90%, the rate for recent deaths (2nd March 2020 to 10th April 2020 in the first iteration of the linkage) was closer to 30% due to an administrative lag in adding NHS Numbers to death records. Our novel bespoke linkage method linked over 39,000 extra death records. Whilst this had minimal impact on the overall linkage rate, it improved the linkage rate for recent deaths to approximately 90%. This was without an impact on accuracy: clerical review demonstrated that the false positive rate was approximately 0.2%. A report was published using this data showing that the risk of death involving Covid-19 among some ethnic groups was significantly higher than others. Determining whether Covid-19 disproportionally affected certain ethnicities was of crucial importance in the early phase of the pandemic to enable appropriate government strategies to be developed. We delivered a bespoke linkage under an exceptional time-limit without compromising on accuracy, enabling this impactful analysis with nation-wide interest and impact. https://ijpds.org/article/view/2050bespoke linkagecovid-19ethnicity
spellingShingle Shelley Gammon
Rachel Shipsey
Charlie Tomlin
Josie Plachta
Bespoke automated linkage to enable analysis of covid deaths by ethnicity.
International Journal of Population Data Science
bespoke linkage
covid-19
ethnicity
title Bespoke automated linkage to enable analysis of covid deaths by ethnicity.
title_full Bespoke automated linkage to enable analysis of covid deaths by ethnicity.
title_fullStr Bespoke automated linkage to enable analysis of covid deaths by ethnicity.
title_full_unstemmed Bespoke automated linkage to enable analysis of covid deaths by ethnicity.
title_short Bespoke automated linkage to enable analysis of covid deaths by ethnicity.
title_sort bespoke automated linkage to enable analysis of covid deaths by ethnicity
topic bespoke linkage
covid-19
ethnicity
url https://ijpds.org/article/view/2050
work_keys_str_mv AT shelleygammon bespokeautomatedlinkagetoenableanalysisofcoviddeathsbyethnicity
AT rachelshipsey bespokeautomatedlinkagetoenableanalysisofcoviddeathsbyethnicity
AT charlietomlin bespokeautomatedlinkagetoenableanalysisofcoviddeathsbyethnicity
AT josieplachta bespokeautomatedlinkagetoenableanalysisofcoviddeathsbyethnicity