Identification of pregnancies and their outcomes in healthcare claims data, 2008–2019: An algorithm
Pregnancy is a condition of broad interest across many medical and health services research domains, but one not easily identified in healthcare claims data. Our objective was to establish an algorithm to identify pregnant women and their pregnancies in claims data. We identified pregnancy-related d...
Main Authors: | , , , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Public Library of Science (PLoS)
2023-01-01
|
Series: | PLoS ONE |
Online Access: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10124843/?tool=EBI |
_version_ | 1797839095715594240 |
---|---|
author | Elizabeth C. Ailes Weiming Zhu Elizabeth A. Clark Ya-lin A. Huang Margaret A. Lampe Athena P. Kourtis Jennita Reefhuis Karen W. Hoover |
author_facet | Elizabeth C. Ailes Weiming Zhu Elizabeth A. Clark Ya-lin A. Huang Margaret A. Lampe Athena P. Kourtis Jennita Reefhuis Karen W. Hoover |
author_sort | Elizabeth C. Ailes |
collection | DOAJ |
description | Pregnancy is a condition of broad interest across many medical and health services research domains, but one not easily identified in healthcare claims data. Our objective was to establish an algorithm to identify pregnant women and their pregnancies in claims data. We identified pregnancy-related diagnosis, procedure, and diagnosis-related group codes, accounting for the transition to International Statistical Classification of Diseases, 10th Revision, Clinical Modification (ICD-10-CM) diagnosis and procedure codes, in health encounter reporting on 10/1/2015. We selected women in Merative MarketScan commercial databases aged 15–49 years with pregnancy-related claims, and their infants, during 2008–2019. Pregnancies, pregnancy outcomes, and gestational ages were assigned using the constellation of service dates, code types, pregnancy outcomes, and linkage to infant records. We describe pregnancy outcomes and gestational ages, as well as maternal age, census region, and health plan type. In a sensitivity analysis, we compared our algorithm-assigned date of last menstrual period (LMP) to fertility procedure-based LMP (date of procedure + 14 days) among women with embryo transfer or insemination procedures. Among 5,812,699 identified pregnancies, most (77.9%) were livebirths, followed by spontaneous abortions (16.2%); 3,274,353 (72.2%) livebirths could be linked to infants. Most pregnancies were among women 25–34 years (59.1%), living in the South (39.1%) and Midwest (22.4%), with large employer-sponsored insurance (52.0%). Outcome distributions were similar across ICD-9 and ICD-10 eras, with some variation in gestational age distribution observed. Sensitivity analyses supported our algorithm’s framework; algorithm- and fertility procedure-derived LMP estimates were within a week of each other (mean difference: -4 days [IQR: -13 to 6 days]; n = 107,870). We have developed an algorithm to identify pregnancies, their gestational age, and outcomes, across ICD-9 and ICD-10 eras using administrative data. This algorithm may be useful to reproductive health researchers investigating a broad range of pregnancy and infant outcomes. |
first_indexed | 2024-04-09T15:52:42Z |
format | Article |
id | doaj.art-22e32023aaf64fc3996d21e420e8be68 |
institution | Directory Open Access Journal |
issn | 1932-6203 |
language | English |
last_indexed | 2024-04-09T15:52:42Z |
publishDate | 2023-01-01 |
publisher | Public Library of Science (PLoS) |
record_format | Article |
series | PLoS ONE |
spelling | doaj.art-22e32023aaf64fc3996d21e420e8be682023-04-26T05:32:04ZengPublic Library of Science (PLoS)PLoS ONE1932-62032023-01-01184Identification of pregnancies and their outcomes in healthcare claims data, 2008–2019: An algorithmElizabeth C. AilesWeiming ZhuElizabeth A. ClarkYa-lin A. HuangMargaret A. LampeAthena P. KourtisJennita ReefhuisKaren W. HooverPregnancy is a condition of broad interest across many medical and health services research domains, but one not easily identified in healthcare claims data. Our objective was to establish an algorithm to identify pregnant women and their pregnancies in claims data. We identified pregnancy-related diagnosis, procedure, and diagnosis-related group codes, accounting for the transition to International Statistical Classification of Diseases, 10th Revision, Clinical Modification (ICD-10-CM) diagnosis and procedure codes, in health encounter reporting on 10/1/2015. We selected women in Merative MarketScan commercial databases aged 15–49 years with pregnancy-related claims, and their infants, during 2008–2019. Pregnancies, pregnancy outcomes, and gestational ages were assigned using the constellation of service dates, code types, pregnancy outcomes, and linkage to infant records. We describe pregnancy outcomes and gestational ages, as well as maternal age, census region, and health plan type. In a sensitivity analysis, we compared our algorithm-assigned date of last menstrual period (LMP) to fertility procedure-based LMP (date of procedure + 14 days) among women with embryo transfer or insemination procedures. Among 5,812,699 identified pregnancies, most (77.9%) were livebirths, followed by spontaneous abortions (16.2%); 3,274,353 (72.2%) livebirths could be linked to infants. Most pregnancies were among women 25–34 years (59.1%), living in the South (39.1%) and Midwest (22.4%), with large employer-sponsored insurance (52.0%). Outcome distributions were similar across ICD-9 and ICD-10 eras, with some variation in gestational age distribution observed. Sensitivity analyses supported our algorithm’s framework; algorithm- and fertility procedure-derived LMP estimates were within a week of each other (mean difference: -4 days [IQR: -13 to 6 days]; n = 107,870). We have developed an algorithm to identify pregnancies, their gestational age, and outcomes, across ICD-9 and ICD-10 eras using administrative data. This algorithm may be useful to reproductive health researchers investigating a broad range of pregnancy and infant outcomes.https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10124843/?tool=EBI |
spellingShingle | Elizabeth C. Ailes Weiming Zhu Elizabeth A. Clark Ya-lin A. Huang Margaret A. Lampe Athena P. Kourtis Jennita Reefhuis Karen W. Hoover Identification of pregnancies and their outcomes in healthcare claims data, 2008–2019: An algorithm PLoS ONE |
title | Identification of pregnancies and their outcomes in healthcare claims data, 2008–2019: An algorithm |
title_full | Identification of pregnancies and their outcomes in healthcare claims data, 2008–2019: An algorithm |
title_fullStr | Identification of pregnancies and their outcomes in healthcare claims data, 2008–2019: An algorithm |
title_full_unstemmed | Identification of pregnancies and their outcomes in healthcare claims data, 2008–2019: An algorithm |
title_short | Identification of pregnancies and their outcomes in healthcare claims data, 2008–2019: An algorithm |
title_sort | identification of pregnancies and their outcomes in healthcare claims data 2008 2019 an algorithm |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10124843/?tool=EBI |
work_keys_str_mv | AT elizabethcailes identificationofpregnanciesandtheiroutcomesinhealthcareclaimsdata20082019analgorithm AT weimingzhu identificationofpregnanciesandtheiroutcomesinhealthcareclaimsdata20082019analgorithm AT elizabethaclark identificationofpregnanciesandtheiroutcomesinhealthcareclaimsdata20082019analgorithm AT yalinahuang identificationofpregnanciesandtheiroutcomesinhealthcareclaimsdata20082019analgorithm AT margaretalampe identificationofpregnanciesandtheiroutcomesinhealthcareclaimsdata20082019analgorithm AT athenapkourtis identificationofpregnanciesandtheiroutcomesinhealthcareclaimsdata20082019analgorithm AT jennitareefhuis identificationofpregnanciesandtheiroutcomesinhealthcareclaimsdata20082019analgorithm AT karenwhoover identificationofpregnanciesandtheiroutcomesinhealthcareclaimsdata20082019analgorithm |