Cohort design and natural language processing to reduce bias in electronic health records research
Abstract Electronic health record (EHR) datasets are statistically powerful but are subject to ascertainment bias and missingness. Using the Mass General Brigham multi-institutional EHR, we approximated a community-based cohort by sampling patients receiving longitudinal primary care between 2001-20...
Main Authors: | Shaan Khurshid, Christopher Reeder, Lia X. Harrington, Pulkit Singh, Gopal Sarma, Samuel F. Friedman, Paolo Di Achille, Nathaniel Diamant, Jonathan W. Cunningham, Ashby C. Turner, Emily S. Lau, Julian S. Haimovich, Mostafa A. Al-Alusi, Xin Wang, Marcus D. R. Klarqvist, Jeffrey M. Ashburner, Christian Diedrich, Mercedeh Ghadessi, Johanna Mielke, Hanna M. Eilken, Alice McElhinney, Andrea Derix, Steven J. Atlas, Patrick T. Ellinor, Anthony A. Philippakis, Christopher D. Anderson, Jennifer E. Ho, Puneet Batra, Steven A. Lubitz |
---|---|
Format: | Article |
Language: | English |
Published: |
Nature Portfolio
2022-04-01
|
Series: | npj Digital Medicine |
Online Access: | https://doi.org/10.1038/s41746-022-00590-0 |
Similar Items
-
Characteristics and Attitudes of Wearable Device Users and Nonusers in a Large Health Care System
by: Rachael A. Venn, et al.
Published: (2024-01-01) -
Unsupervised deep learning of electrocardiograms enables scalable human disease profiling
by: Sam F. Friedman, et al.
Published: (2025-01-01) -
Comparative Clinical Effectiveness of Population‐Based Atrial Fibrillation Screening Using Contemporary Modalities: A Decision‐Analytic Model
by: Shaan Khurshid, et al.
Published: (2021-09-01) -
Physiology as a Lingua Franca for Clinical Machine Learning
by: Gopal P. Sarma, et al.
Published: (2020-05-01) -
A deep learning digital biomarker to detect hypertension and stratify cardiovascular risk from the electrocardiogram
by: Mostafa A. Al-Alusi, et al.
Published: (2025-02-01)