The validity of routine individuated programmatic data in HIV surveillance assessed over a 10-year period.

Objectives In the Western Cape, South Africa, public-sector individual-level routine data are consolidated from multiple non-EMR sources through the Provincial Health Data Centre (PHDC).  This enables the description of temporal changes in population-wide antenatal HIV sero-prevalence. We evaluated...

Full description

Bibliographic Details
Main Authors: Nisha Jacob, Brian Rice, Andrew Boulle
Format: Article
Language:English
Published: Swansea University 2022-08-01
Series:International Journal of Population Data Science
Subjects:
Online Access:https://ijpds.org/article/view/1840
_version_ 1797423102228955136
author Nisha Jacob
Brian Rice
Andrew Boulle
author_facet Nisha Jacob
Brian Rice
Andrew Boulle
author_sort Nisha Jacob
collection DOAJ
description Objectives In the Western Cape, South Africa, public-sector individual-level routine data are consolidated from multiple non-EMR sources through the Provincial Health Data Centre (PHDC).  This enables the description of temporal changes in population-wide antenatal HIV sero-prevalence. We evaluated the validity of these data compared to aggregated program data and sentinel surveys. Approach We conducted a retrospective cohort analysis of all pregnancies consolidated in the PHDC from January 2011 to December 2020. Evidence of antenatal and HIV care from electronic platforms (each with its own multi-source phenotype algorithm) were linked using a unique patient identifier. HIV prevalence estimates were triangulated with available sentinel national and provincial antenatal survey estimates as well as aggregated programmatic data from registers as recorded in the District Health Information System. Provincial, district-level and age-group HIV prevalence estimates were compared between data systems using correlation coefficients, absolute differences and trend analysis. Results In total, 982,914 pregnancies were ascertained with a median maternal age of 26.9 years. Between 2011–2013, PHDC HIV prevalence estimates were widely disparate from aggregate and survey data (due to incomplete electronic data) whereas, from 2014 onwards, estimates were more closely correlated to aggregate data estimates (r=0.83; p=0.02) with an average absolute prevalence difference of 0.97%. In keeping with survey and aggregate data trends, PHDC data show a relatively stable provincial HIV prevalence from 17.0 (95%CI 16.8%–17.2%) in 2015 to 18.9% (95%CI 18.7–19.1%) in 2020. The highest HIV prevalence was in the Cape Metro district (20.6%; 95%CI 20.4%–20.9%). Prevalence estimates by age group were comparable between sentinel surveys and PHDC from 2014 onwards with an average absolute prevalence difference of 1.6%. Conclusion This study is the first to compare sentinel sero-prevalence surveys with both register-based aggregate data and consolidated individuated administrative data. We show that in this setting linked individuated data may be reliably used for HIV surveillance and provide more granular estimates with greater efficiency.
first_indexed 2024-03-09T07:42:36Z
format Article
id doaj.art-d1f800d05c714bca9528f1e97869a77e
institution Directory Open Access Journal
issn 2399-4908
language English
last_indexed 2024-03-09T07:42:36Z
publishDate 2022-08-01
publisher Swansea University
record_format Article
series International Journal of Population Data Science
spelling doaj.art-d1f800d05c714bca9528f1e97869a77e2023-12-03T04:22:57ZengSwansea UniversityInternational Journal of Population Data Science2399-49082022-08-017310.23889/ijpds.v7i3.1840The validity of routine individuated programmatic data in HIV surveillance assessed over a 10-year period.Nisha Jacob0Brian Rice1Andrew Boulle2University of Cape TownLondon School of Hygiene and Tropical MedicineUniversity of Cape Town Objectives In the Western Cape, South Africa, public-sector individual-level routine data are consolidated from multiple non-EMR sources through the Provincial Health Data Centre (PHDC).  This enables the description of temporal changes in population-wide antenatal HIV sero-prevalence. We evaluated the validity of these data compared to aggregated program data and sentinel surveys. Approach We conducted a retrospective cohort analysis of all pregnancies consolidated in the PHDC from January 2011 to December 2020. Evidence of antenatal and HIV care from electronic platforms (each with its own multi-source phenotype algorithm) were linked using a unique patient identifier. HIV prevalence estimates were triangulated with available sentinel national and provincial antenatal survey estimates as well as aggregated programmatic data from registers as recorded in the District Health Information System. Provincial, district-level and age-group HIV prevalence estimates were compared between data systems using correlation coefficients, absolute differences and trend analysis. Results In total, 982,914 pregnancies were ascertained with a median maternal age of 26.9 years. Between 2011–2013, PHDC HIV prevalence estimates were widely disparate from aggregate and survey data (due to incomplete electronic data) whereas, from 2014 onwards, estimates were more closely correlated to aggregate data estimates (r=0.83; p=0.02) with an average absolute prevalence difference of 0.97%. In keeping with survey and aggregate data trends, PHDC data show a relatively stable provincial HIV prevalence from 17.0 (95%CI 16.8%–17.2%) in 2015 to 18.9% (95%CI 18.7–19.1%) in 2020. The highest HIV prevalence was in the Cape Metro district (20.6%; 95%CI 20.4%–20.9%). Prevalence estimates by age group were comparable between sentinel surveys and PHDC from 2014 onwards with an average absolute prevalence difference of 1.6%. Conclusion This study is the first to compare sentinel sero-prevalence surveys with both register-based aggregate data and consolidated individuated administrative data. We show that in this setting linked individuated data may be reliably used for HIV surveillance and provide more granular estimates with greater efficiency. https://ijpds.org/article/view/1840HIV surveillanceroutine datadata linkagesentinel surveysHIV sero-prevalence
spellingShingle Nisha Jacob
Brian Rice
Andrew Boulle
The validity of routine individuated programmatic data in HIV surveillance assessed over a 10-year period.
International Journal of Population Data Science
HIV surveillance
routine data
data linkage
sentinel surveys
HIV sero-prevalence
title The validity of routine individuated programmatic data in HIV surveillance assessed over a 10-year period.
title_full The validity of routine individuated programmatic data in HIV surveillance assessed over a 10-year period.
title_fullStr The validity of routine individuated programmatic data in HIV surveillance assessed over a 10-year period.
title_full_unstemmed The validity of routine individuated programmatic data in HIV surveillance assessed over a 10-year period.
title_short The validity of routine individuated programmatic data in HIV surveillance assessed over a 10-year period.
title_sort validity of routine individuated programmatic data in hiv surveillance assessed over a 10 year period
topic HIV surveillance
routine data
data linkage
sentinel surveys
HIV sero-prevalence
url https://ijpds.org/article/view/1840
work_keys_str_mv AT nishajacob thevalidityofroutineindividuatedprogrammaticdatainhivsurveillanceassessedovera10yearperiod
AT brianrice thevalidityofroutineindividuatedprogrammaticdatainhivsurveillanceassessedovera10yearperiod
AT andrewboulle thevalidityofroutineindividuatedprogrammaticdatainhivsurveillanceassessedovera10yearperiod
AT nishajacob validityofroutineindividuatedprogrammaticdatainhivsurveillanceassessedovera10yearperiod
AT brianrice validityofroutineindividuatedprogrammaticdatainhivsurveillanceassessedovera10yearperiod
AT andrewboulle validityofroutineindividuatedprogrammaticdatainhivsurveillanceassessedovera10yearperiod