Consistency, completeness and external validity of ethnicity recording in NHS primary care records: a cohort study in 25 million patients’ records at source using OpenSAFELY

Background: Ethnicity is known to be an important correlate of health outcomes, particularly during the COVID-19 pandemic, where some ethnic groups were shown to be at higher risk of infection and adverse outcomes. The recording of patients’ ethnic groups in primary care can support research and eff...

Full description

Bibliographic Details
Main Authors: Andrews, CD, Mathur, R, Massey, J, Park, R, Curtis, HJ, Hopcroft, L, Mehrkar, A, Bacon, S, Hickman, G, Smith, R, Evans, D, Ward, T, Davy, S, Inglesby, P, Dillingham, I, Maude, S, O’Dwyer, T, Butler-Cole, BFC, Bridges, L, Bates, C, Parry, J, Hester, F, Harper, S, Cockburn, J, Goldacre, B, MacKenna, B, Walker, AJ, Hulme, WJ
Format: Journal article
Language:English
Published: BioMed Central 2024
_version_ 1811139978533535744
author Andrews, CD
Mathur, R
Massey, J
Park, R
Curtis, HJ
Hopcroft, L
Mehrkar, A
Bacon, S
Hickman, G
Smith, R
Evans, D
Ward, T
Davy, S
Inglesby, P
Dillingham, I
Maude, S
O’Dwyer, T
Butler-Cole, BFC
Bridges, L
Bates, C
Parry, J
Hester, F
Harper, S
Cockburn, J
Goldacre, B
MacKenna, B
Walker, AJ
Hulme, WJ
author_facet Andrews, CD
Mathur, R
Massey, J
Park, R
Curtis, HJ
Hopcroft, L
Mehrkar, A
Bacon, S
Hickman, G
Smith, R
Evans, D
Ward, T
Davy, S
Inglesby, P
Dillingham, I
Maude, S
O’Dwyer, T
Butler-Cole, BFC
Bridges, L
Bates, C
Parry, J
Hester, F
Harper, S
Cockburn, J
Goldacre, B
MacKenna, B
Walker, AJ
Hulme, WJ
author_sort Andrews, CD
collection OXFORD
description Background: Ethnicity is known to be an important correlate of health outcomes, particularly during the COVID-19 pandemic, where some ethnic groups were shown to be at higher risk of infection and adverse outcomes. The recording of patients’ ethnic groups in primary care can support research and efforts to achieve equity in service provision and outcomes; however, the coding of ethnicity is known to present complex challenges. We therefore set out to describe ethnicity coding in detail with a view to supporting the use of this data in a wide range of settings, as part of wider efforts to robustly describe and define methods of using administrative data. Methods: We describe the completeness and consistency of primary care ethnicity recording in the OpenSAFELY-TPP database, containing linked primary care and hospital records in > 25 million patients in England. We also compared the ethnic breakdown in OpenSAFELY-TPP with that of the 2021 UK census. Results: 78.2% of patients registered in OpenSAFELY-TPP on 1 January 2022 had their ethnicity recorded in primary care records, rising to 92.5% when supplemented with hospital data. The completeness of ethnicity recording was higher for women than for men. The rate of primary care ethnicity recording ranged from 77% in the South East of England to 82.2% in the West Midlands. Ethnicity recording rates were higher in patients with chronic or other serious health conditions. For each of the five broad ethnicity groups, primary care recorded ethnicity was within 2.9 percentage points of the population rate as recorded in the 2021 Census for England as a whole. For patients with multiple ethnicity records, 98.7% of the latest recorded ethnicities matched the most frequently coded ethnicity. Patients whose latest recorded ethnicity was categorised as Other were most likely to have a discordant ethnicity recording (32.2%). Conclusions: Primary care ethnicity data in OpenSAFELY is present for over three quarters of all patients, and combined with data from other sources can achieve a high level of completeness. The overall distribution of ethnicities across all English OpenSAFELY-TPP practices was similar to the 2021 Census, with some regional variation. This report identifies the best available codelist for use in OpenSAFELY and similar electronic health record data.
first_indexed 2024-09-25T04:14:40Z
format Journal article
id oxford-uuid:f14c6bd3-7332-44f2-a98f-16398c5ca096
institution University of Oxford
language English
last_indexed 2024-09-25T04:14:40Z
publishDate 2024
publisher BioMed Central
record_format dspace
spelling oxford-uuid:f14c6bd3-7332-44f2-a98f-16398c5ca0962024-07-10T20:22:32ZConsistency, completeness and external validity of ethnicity recording in NHS primary care records: a cohort study in 25 million patients’ records at source using OpenSAFELYJournal articlehttp://purl.org/coar/resource_type/c_dcae04bcuuid:f14c6bd3-7332-44f2-a98f-16398c5ca096EnglishJisc Publications RouterBioMed Central2024Andrews, CDMathur, RMassey, JPark, RCurtis, HJHopcroft, LMehrkar, ABacon, SHickman, GSmith, REvans, DWard, TDavy, SInglesby, PDillingham, IMaude, SO’Dwyer, TButler-Cole, BFCBridges, LBates, CParry, JHester, FHarper, SCockburn, JGoldacre, BMacKenna, BWalker, AJHulme, WJBackground: Ethnicity is known to be an important correlate of health outcomes, particularly during the COVID-19 pandemic, where some ethnic groups were shown to be at higher risk of infection and adverse outcomes. The recording of patients’ ethnic groups in primary care can support research and efforts to achieve equity in service provision and outcomes; however, the coding of ethnicity is known to present complex challenges. We therefore set out to describe ethnicity coding in detail with a view to supporting the use of this data in a wide range of settings, as part of wider efforts to robustly describe and define methods of using administrative data. Methods: We describe the completeness and consistency of primary care ethnicity recording in the OpenSAFELY-TPP database, containing linked primary care and hospital records in > 25 million patients in England. We also compared the ethnic breakdown in OpenSAFELY-TPP with that of the 2021 UK census. Results: 78.2% of patients registered in OpenSAFELY-TPP on 1 January 2022 had their ethnicity recorded in primary care records, rising to 92.5% when supplemented with hospital data. The completeness of ethnicity recording was higher for women than for men. The rate of primary care ethnicity recording ranged from 77% in the South East of England to 82.2% in the West Midlands. Ethnicity recording rates were higher in patients with chronic or other serious health conditions. For each of the five broad ethnicity groups, primary care recorded ethnicity was within 2.9 percentage points of the population rate as recorded in the 2021 Census for England as a whole. For patients with multiple ethnicity records, 98.7% of the latest recorded ethnicities matched the most frequently coded ethnicity. Patients whose latest recorded ethnicity was categorised as Other were most likely to have a discordant ethnicity recording (32.2%). Conclusions: Primary care ethnicity data in OpenSAFELY is present for over three quarters of all patients, and combined with data from other sources can achieve a high level of completeness. The overall distribution of ethnicities across all English OpenSAFELY-TPP practices was similar to the 2021 Census, with some regional variation. This report identifies the best available codelist for use in OpenSAFELY and similar electronic health record data.
spellingShingle Andrews, CD
Mathur, R
Massey, J
Park, R
Curtis, HJ
Hopcroft, L
Mehrkar, A
Bacon, S
Hickman, G
Smith, R
Evans, D
Ward, T
Davy, S
Inglesby, P
Dillingham, I
Maude, S
O’Dwyer, T
Butler-Cole, BFC
Bridges, L
Bates, C
Parry, J
Hester, F
Harper, S
Cockburn, J
Goldacre, B
MacKenna, B
Walker, AJ
Hulme, WJ
Consistency, completeness and external validity of ethnicity recording in NHS primary care records: a cohort study in 25 million patients’ records at source using OpenSAFELY
title Consistency, completeness and external validity of ethnicity recording in NHS primary care records: a cohort study in 25 million patients’ records at source using OpenSAFELY
title_full Consistency, completeness and external validity of ethnicity recording in NHS primary care records: a cohort study in 25 million patients’ records at source using OpenSAFELY
title_fullStr Consistency, completeness and external validity of ethnicity recording in NHS primary care records: a cohort study in 25 million patients’ records at source using OpenSAFELY
title_full_unstemmed Consistency, completeness and external validity of ethnicity recording in NHS primary care records: a cohort study in 25 million patients’ records at source using OpenSAFELY
title_short Consistency, completeness and external validity of ethnicity recording in NHS primary care records: a cohort study in 25 million patients’ records at source using OpenSAFELY
title_sort consistency completeness and external validity of ethnicity recording in nhs primary care records a cohort study in 25 million patients records at source using opensafely
work_keys_str_mv AT andrewscd consistencycompletenessandexternalvalidityofethnicityrecordinginnhsprimarycarerecordsacohortstudyin25millionpatientsrecordsatsourceusingopensafely
AT mathurr consistencycompletenessandexternalvalidityofethnicityrecordinginnhsprimarycarerecordsacohortstudyin25millionpatientsrecordsatsourceusingopensafely
AT masseyj consistencycompletenessandexternalvalidityofethnicityrecordinginnhsprimarycarerecordsacohortstudyin25millionpatientsrecordsatsourceusingopensafely
AT parkr consistencycompletenessandexternalvalidityofethnicityrecordinginnhsprimarycarerecordsacohortstudyin25millionpatientsrecordsatsourceusingopensafely
AT curtishj consistencycompletenessandexternalvalidityofethnicityrecordinginnhsprimarycarerecordsacohortstudyin25millionpatientsrecordsatsourceusingopensafely
AT hopcroftl consistencycompletenessandexternalvalidityofethnicityrecordinginnhsprimarycarerecordsacohortstudyin25millionpatientsrecordsatsourceusingopensafely
AT mehrkara consistencycompletenessandexternalvalidityofethnicityrecordinginnhsprimarycarerecordsacohortstudyin25millionpatientsrecordsatsourceusingopensafely
AT bacons consistencycompletenessandexternalvalidityofethnicityrecordinginnhsprimarycarerecordsacohortstudyin25millionpatientsrecordsatsourceusingopensafely
AT hickmang consistencycompletenessandexternalvalidityofethnicityrecordinginnhsprimarycarerecordsacohortstudyin25millionpatientsrecordsatsourceusingopensafely
AT smithr consistencycompletenessandexternalvalidityofethnicityrecordinginnhsprimarycarerecordsacohortstudyin25millionpatientsrecordsatsourceusingopensafely
AT evansd consistencycompletenessandexternalvalidityofethnicityrecordinginnhsprimarycarerecordsacohortstudyin25millionpatientsrecordsatsourceusingopensafely
AT wardt consistencycompletenessandexternalvalidityofethnicityrecordinginnhsprimarycarerecordsacohortstudyin25millionpatientsrecordsatsourceusingopensafely
AT davys consistencycompletenessandexternalvalidityofethnicityrecordinginnhsprimarycarerecordsacohortstudyin25millionpatientsrecordsatsourceusingopensafely
AT inglesbyp consistencycompletenessandexternalvalidityofethnicityrecordinginnhsprimarycarerecordsacohortstudyin25millionpatientsrecordsatsourceusingopensafely
AT dillinghami consistencycompletenessandexternalvalidityofethnicityrecordinginnhsprimarycarerecordsacohortstudyin25millionpatientsrecordsatsourceusingopensafely
AT maudes consistencycompletenessandexternalvalidityofethnicityrecordinginnhsprimarycarerecordsacohortstudyin25millionpatientsrecordsatsourceusingopensafely
AT odwyert consistencycompletenessandexternalvalidityofethnicityrecordinginnhsprimarycarerecordsacohortstudyin25millionpatientsrecordsatsourceusingopensafely
AT butlercolebfc consistencycompletenessandexternalvalidityofethnicityrecordinginnhsprimarycarerecordsacohortstudyin25millionpatientsrecordsatsourceusingopensafely
AT bridgesl consistencycompletenessandexternalvalidityofethnicityrecordinginnhsprimarycarerecordsacohortstudyin25millionpatientsrecordsatsourceusingopensafely
AT batesc consistencycompletenessandexternalvalidityofethnicityrecordinginnhsprimarycarerecordsacohortstudyin25millionpatientsrecordsatsourceusingopensafely
AT parryj consistencycompletenessandexternalvalidityofethnicityrecordinginnhsprimarycarerecordsacohortstudyin25millionpatientsrecordsatsourceusingopensafely
AT hesterf consistencycompletenessandexternalvalidityofethnicityrecordinginnhsprimarycarerecordsacohortstudyin25millionpatientsrecordsatsourceusingopensafely
AT harpers consistencycompletenessandexternalvalidityofethnicityrecordinginnhsprimarycarerecordsacohortstudyin25millionpatientsrecordsatsourceusingopensafely
AT cockburnj consistencycompletenessandexternalvalidityofethnicityrecordinginnhsprimarycarerecordsacohortstudyin25millionpatientsrecordsatsourceusingopensafely
AT goldacreb consistencycompletenessandexternalvalidityofethnicityrecordinginnhsprimarycarerecordsacohortstudyin25millionpatientsrecordsatsourceusingopensafely
AT mackennab consistencycompletenessandexternalvalidityofethnicityrecordinginnhsprimarycarerecordsacohortstudyin25millionpatientsrecordsatsourceusingopensafely
AT walkeraj consistencycompletenessandexternalvalidityofethnicityrecordinginnhsprimarycarerecordsacohortstudyin25millionpatientsrecordsatsourceusingopensafely
AT hulmewj consistencycompletenessandexternalvalidityofethnicityrecordinginnhsprimarycarerecordsacohortstudyin25millionpatientsrecordsatsourceusingopensafely