Probabilistic record linkage of de-identified research datasets with discrepancies using diagnosis codes
We develop an algorithm for probabilistic linkage of de-identified research datasets at the patient level, when only diagnosis codes with discrepancies and no personal health identifiers such as name or date of birth are available. It relies on Bayesian modelling of binarized diagnosis codes, and pr...
Main Authors: | Hejblum, Boris P., Weber, Griffin M., Liao, Katherine P., Palmer, Nathan P., Churchill, Susanne, Shadick, Nancy A., Szolovits, Peter, Murphy, Shawn N., Kohane, Isaac S., Cai, Tianxi |
---|---|
Other Authors: | Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory |
Format: | Article |
Language: | English |
Published: |
Springer Nature
2019
|
Online Access: | https://hdl.handle.net/1721.1/122815 |
Similar Items
-
Estimating parameters for probabilistic linkage of privacy-preserved datasets
by: Adrian P. Brown, et al.
Published: (2017-07-01) -
Electronic Medical Records for Discovery Research in Rheumatoid Arthritis
by: Szolovits, Peter, et al.
Published: (2012) -
Serum Inflammatory Markers and Risk of Colorectal Cancer in Patients With Inflammatory Bowel Diseases
by: Ananthakrishnan, Ashwin N., et al.
Published: (2016) -
Mortality and extraintestinal cancers in patients with primary sclerosing cholangitis and inflammatory bowel disease
by: Ananthakrishnan, Ashwin N., et al.
Published: (2016) -
Association Between Reduced Plasma 25-Hydroxy Vitamin D and Increased Risk of Cancer in Patients With Inflammatory Bowel Diseases
by: Ananthakrishnan, Ashwin N., et al.
Published: (2016)