Early detection of Parkinson’s disease through enriching the electronic health record using a biomedical knowledge graph

IntroductionEarly diagnosis of Parkinson’s disease (PD) is important to identify treatments to slow neurodegeneration. People who develop PD often have symptoms before the disease manifests and may be coded as diagnoses in the electronic health record (EHR).MethodsTo predict PD diagnosis, we embedde...

Full description

Bibliographic Details
Main Authors: Karthik Soman, Charlotte A. Nelson, Gabriel Cerono, Samuel M. Goldman, Sergio E. Baranzini, Ethan G. Brown
Format: Article
Language:English
Published: Frontiers Media S.A. 2023-05-01
Series:Frontiers in Medicine
Subjects:
Online Access:https://www.frontiersin.org/articles/10.3389/fmed.2023.1081087/full
_version_ 1797829010229559296
author Karthik Soman
Charlotte A. Nelson
Gabriel Cerono
Samuel M. Goldman
Sergio E. Baranzini
Ethan G. Brown
author_facet Karthik Soman
Charlotte A. Nelson
Gabriel Cerono
Samuel M. Goldman
Sergio E. Baranzini
Ethan G. Brown
author_sort Karthik Soman
collection DOAJ
description IntroductionEarly diagnosis of Parkinson’s disease (PD) is important to identify treatments to slow neurodegeneration. People who develop PD often have symptoms before the disease manifests and may be coded as diagnoses in the electronic health record (EHR).MethodsTo predict PD diagnosis, we embedded EHR data of patients onto a biomedical knowledge graph called Scalable Precision medicine Open Knowledge Engine (SPOKE) and created patient embedding vectors. We trained and validated a classifier using these vectors from 3,004 PD patients, restricting records to 1, 3, and 5 years before diagnosis, and 457,197 non-PD group.ResultsThe classifier predicted PD diagnosis with moderate accuracy (AUC = 0.77 ± 0.06, 0.74 ± 0.05, 0.72 ± 0.05 at 1, 3, and 5 years) and performed better than other benchmark methods. Nodes in the SPOKE graph, among cases, revealed novel associations, while SPOKE patient vectors revealed the basis for individual risk classification.DiscussionThe proposed method was able to explain the clinical predictions using the knowledge graph, thereby making the predictions clinically interpretable. Through enriching EHR data with biomedical associations, SPOKE may be a cost-efficient and personalized way to predict PD diagnosis years before its occurrence.
first_indexed 2024-04-09T13:13:32Z
format Article
id doaj.art-814bed3af30e4b41b23cf6ab26fbd7ec
institution Directory Open Access Journal
issn 2296-858X
language English
last_indexed 2024-04-09T13:13:32Z
publishDate 2023-05-01
publisher Frontiers Media S.A.
record_format Article
series Frontiers in Medicine
spelling doaj.art-814bed3af30e4b41b23cf6ab26fbd7ec2023-05-12T05:40:39ZengFrontiers Media S.A.Frontiers in Medicine2296-858X2023-05-011010.3389/fmed.2023.10810871081087Early detection of Parkinson’s disease through enriching the electronic health record using a biomedical knowledge graphKarthik Soman0Charlotte A. Nelson1Gabriel Cerono2Samuel M. Goldman3Sergio E. Baranzini4Ethan G. Brown5Department of Neurology, Weill Institute for Neurosciences, University of California, San Francisco, San Francisco, CA, United StatesDepartment of Neurology, Weill Institute for Neurosciences, University of California, San Francisco, San Francisco, CA, United StatesDepartment of Neurology, Weill Institute for Neurosciences, University of California, San Francisco, San Francisco, CA, United StatesDivision of Occupational and Environmental Medicine, University of California, San Francisco, San Francisco, CA, United StatesDepartment of Neurology, Weill Institute for Neurosciences, University of California, San Francisco, San Francisco, CA, United StatesDepartment of Neurology, Weill Institute for Neurosciences, University of California, San Francisco, San Francisco, CA, United StatesIntroductionEarly diagnosis of Parkinson’s disease (PD) is important to identify treatments to slow neurodegeneration. People who develop PD often have symptoms before the disease manifests and may be coded as diagnoses in the electronic health record (EHR).MethodsTo predict PD diagnosis, we embedded EHR data of patients onto a biomedical knowledge graph called Scalable Precision medicine Open Knowledge Engine (SPOKE) and created patient embedding vectors. We trained and validated a classifier using these vectors from 3,004 PD patients, restricting records to 1, 3, and 5 years before diagnosis, and 457,197 non-PD group.ResultsThe classifier predicted PD diagnosis with moderate accuracy (AUC = 0.77 ± 0.06, 0.74 ± 0.05, 0.72 ± 0.05 at 1, 3, and 5 years) and performed better than other benchmark methods. Nodes in the SPOKE graph, among cases, revealed novel associations, while SPOKE patient vectors revealed the basis for individual risk classification.DiscussionThe proposed method was able to explain the clinical predictions using the knowledge graph, thereby making the predictions clinically interpretable. Through enriching EHR data with biomedical associations, SPOKE may be a cost-efficient and personalized way to predict PD diagnosis years before its occurrence.https://www.frontiersin.org/articles/10.3389/fmed.2023.1081087/fullParkinson diseaseneurodegenerative disorderelectronic health recordknowledge graphgraph algorithmmachine learning
spellingShingle Karthik Soman
Charlotte A. Nelson
Gabriel Cerono
Samuel M. Goldman
Sergio E. Baranzini
Ethan G. Brown
Early detection of Parkinson’s disease through enriching the electronic health record using a biomedical knowledge graph
Frontiers in Medicine
Parkinson disease
neurodegenerative disorder
electronic health record
knowledge graph
graph algorithm
machine learning
title Early detection of Parkinson’s disease through enriching the electronic health record using a biomedical knowledge graph
title_full Early detection of Parkinson’s disease through enriching the electronic health record using a biomedical knowledge graph
title_fullStr Early detection of Parkinson’s disease through enriching the electronic health record using a biomedical knowledge graph
title_full_unstemmed Early detection of Parkinson’s disease through enriching the electronic health record using a biomedical knowledge graph
title_short Early detection of Parkinson’s disease through enriching the electronic health record using a biomedical knowledge graph
title_sort early detection of parkinson s disease through enriching the electronic health record using a biomedical knowledge graph
topic Parkinson disease
neurodegenerative disorder
electronic health record
knowledge graph
graph algorithm
machine learning
url https://www.frontiersin.org/articles/10.3389/fmed.2023.1081087/full
work_keys_str_mv AT karthiksoman earlydetectionofparkinsonsdiseasethroughenrichingtheelectronichealthrecordusingabiomedicalknowledgegraph
AT charlotteanelson earlydetectionofparkinsonsdiseasethroughenrichingtheelectronichealthrecordusingabiomedicalknowledgegraph
AT gabrielcerono earlydetectionofparkinsonsdiseasethroughenrichingtheelectronichealthrecordusingabiomedicalknowledgegraph
AT samuelmgoldman earlydetectionofparkinsonsdiseasethroughenrichingtheelectronichealthrecordusingabiomedicalknowledgegraph
AT sergioebaranzini earlydetectionofparkinsonsdiseasethroughenrichingtheelectronichealthrecordusingabiomedicalknowledgegraph
AT ethangbrown earlydetectionofparkinsonsdiseasethroughenrichingtheelectronichealthrecordusingabiomedicalknowledgegraph