Identification of pregnancies and their outcomes in healthcare claims data, 2008–2019: An algorithm

Pregnancy is a condition of broad interest across many medical and health services research domains, but one not easily identified in healthcare claims data. Our objective was to establish an algorithm to identify pregnant women and their pregnancies in claims data. We identified pregnancy-related d...

Full description

Bibliographic Details
Main Authors: Elizabeth C. Ailes, Weiming Zhu, Elizabeth A. Clark, Ya-lin A. Huang, Margaret A. Lampe, Athena P. Kourtis, Jennita Reefhuis, Karen W. Hoover
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2023-01-01
Series:PLoS ONE
Online Access:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10124843/?tool=EBI
_version_ 1797839095715594240
author Elizabeth C. Ailes
Weiming Zhu
Elizabeth A. Clark
Ya-lin A. Huang
Margaret A. Lampe
Athena P. Kourtis
Jennita Reefhuis
Karen W. Hoover
author_facet Elizabeth C. Ailes
Weiming Zhu
Elizabeth A. Clark
Ya-lin A. Huang
Margaret A. Lampe
Athena P. Kourtis
Jennita Reefhuis
Karen W. Hoover
author_sort Elizabeth C. Ailes
collection DOAJ
description Pregnancy is a condition of broad interest across many medical and health services research domains, but one not easily identified in healthcare claims data. Our objective was to establish an algorithm to identify pregnant women and their pregnancies in claims data. We identified pregnancy-related diagnosis, procedure, and diagnosis-related group codes, accounting for the transition to International Statistical Classification of Diseases, 10th Revision, Clinical Modification (ICD-10-CM) diagnosis and procedure codes, in health encounter reporting on 10/1/2015. We selected women in Merative MarketScan commercial databases aged 15–49 years with pregnancy-related claims, and their infants, during 2008–2019. Pregnancies, pregnancy outcomes, and gestational ages were assigned using the constellation of service dates, code types, pregnancy outcomes, and linkage to infant records. We describe pregnancy outcomes and gestational ages, as well as maternal age, census region, and health plan type. In a sensitivity analysis, we compared our algorithm-assigned date of last menstrual period (LMP) to fertility procedure-based LMP (date of procedure + 14 days) among women with embryo transfer or insemination procedures. Among 5,812,699 identified pregnancies, most (77.9%) were livebirths, followed by spontaneous abortions (16.2%); 3,274,353 (72.2%) livebirths could be linked to infants. Most pregnancies were among women 25–34 years (59.1%), living in the South (39.1%) and Midwest (22.4%), with large employer-sponsored insurance (52.0%). Outcome distributions were similar across ICD-9 and ICD-10 eras, with some variation in gestational age distribution observed. Sensitivity analyses supported our algorithm’s framework; algorithm- and fertility procedure-derived LMP estimates were within a week of each other (mean difference: -4 days [IQR: -13 to 6 days]; n = 107,870). We have developed an algorithm to identify pregnancies, their gestational age, and outcomes, across ICD-9 and ICD-10 eras using administrative data. This algorithm may be useful to reproductive health researchers investigating a broad range of pregnancy and infant outcomes.
first_indexed 2024-04-09T15:52:42Z
format Article
id doaj.art-22e32023aaf64fc3996d21e420e8be68
institution Directory Open Access Journal
issn 1932-6203
language English
last_indexed 2024-04-09T15:52:42Z
publishDate 2023-01-01
publisher Public Library of Science (PLoS)
record_format Article
series PLoS ONE
spelling doaj.art-22e32023aaf64fc3996d21e420e8be682023-04-26T05:32:04ZengPublic Library of Science (PLoS)PLoS ONE1932-62032023-01-01184Identification of pregnancies and their outcomes in healthcare claims data, 2008–2019: An algorithmElizabeth C. AilesWeiming ZhuElizabeth A. ClarkYa-lin A. HuangMargaret A. LampeAthena P. KourtisJennita ReefhuisKaren W. HooverPregnancy is a condition of broad interest across many medical and health services research domains, but one not easily identified in healthcare claims data. Our objective was to establish an algorithm to identify pregnant women and their pregnancies in claims data. We identified pregnancy-related diagnosis, procedure, and diagnosis-related group codes, accounting for the transition to International Statistical Classification of Diseases, 10th Revision, Clinical Modification (ICD-10-CM) diagnosis and procedure codes, in health encounter reporting on 10/1/2015. We selected women in Merative MarketScan commercial databases aged 15–49 years with pregnancy-related claims, and their infants, during 2008–2019. Pregnancies, pregnancy outcomes, and gestational ages were assigned using the constellation of service dates, code types, pregnancy outcomes, and linkage to infant records. We describe pregnancy outcomes and gestational ages, as well as maternal age, census region, and health plan type. In a sensitivity analysis, we compared our algorithm-assigned date of last menstrual period (LMP) to fertility procedure-based LMP (date of procedure + 14 days) among women with embryo transfer or insemination procedures. Among 5,812,699 identified pregnancies, most (77.9%) were livebirths, followed by spontaneous abortions (16.2%); 3,274,353 (72.2%) livebirths could be linked to infants. Most pregnancies were among women 25–34 years (59.1%), living in the South (39.1%) and Midwest (22.4%), with large employer-sponsored insurance (52.0%). Outcome distributions were similar across ICD-9 and ICD-10 eras, with some variation in gestational age distribution observed. Sensitivity analyses supported our algorithm’s framework; algorithm- and fertility procedure-derived LMP estimates were within a week of each other (mean difference: -4 days [IQR: -13 to 6 days]; n = 107,870). We have developed an algorithm to identify pregnancies, their gestational age, and outcomes, across ICD-9 and ICD-10 eras using administrative data. This algorithm may be useful to reproductive health researchers investigating a broad range of pregnancy and infant outcomes.https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10124843/?tool=EBI
spellingShingle Elizabeth C. Ailes
Weiming Zhu
Elizabeth A. Clark
Ya-lin A. Huang
Margaret A. Lampe
Athena P. Kourtis
Jennita Reefhuis
Karen W. Hoover
Identification of pregnancies and their outcomes in healthcare claims data, 2008–2019: An algorithm
PLoS ONE
title Identification of pregnancies and their outcomes in healthcare claims data, 2008–2019: An algorithm
title_full Identification of pregnancies and their outcomes in healthcare claims data, 2008–2019: An algorithm
title_fullStr Identification of pregnancies and their outcomes in healthcare claims data, 2008–2019: An algorithm
title_full_unstemmed Identification of pregnancies and their outcomes in healthcare claims data, 2008–2019: An algorithm
title_short Identification of pregnancies and their outcomes in healthcare claims data, 2008–2019: An algorithm
title_sort identification of pregnancies and their outcomes in healthcare claims data 2008 2019 an algorithm
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10124843/?tool=EBI
work_keys_str_mv AT elizabethcailes identificationofpregnanciesandtheiroutcomesinhealthcareclaimsdata20082019analgorithm
AT weimingzhu identificationofpregnanciesandtheiroutcomesinhealthcareclaimsdata20082019analgorithm
AT elizabethaclark identificationofpregnanciesandtheiroutcomesinhealthcareclaimsdata20082019analgorithm
AT yalinahuang identificationofpregnanciesandtheiroutcomesinhealthcareclaimsdata20082019analgorithm
AT margaretalampe identificationofpregnanciesandtheiroutcomesinhealthcareclaimsdata20082019analgorithm
AT athenapkourtis identificationofpregnanciesandtheiroutcomesinhealthcareclaimsdata20082019analgorithm
AT jennitareefhuis identificationofpregnanciesandtheiroutcomesinhealthcareclaimsdata20082019analgorithm
AT karenwhoover identificationofpregnanciesandtheiroutcomesinhealthcareclaimsdata20082019analgorithm