Machine learning provides evidence that stroke risk is not linear: The non-linear Framingham stroke risk score.

Current stroke risk assessment tools presume the impact of risk factors is linear and cumulative. However, both novel risk factors and their interplay influencing stroke incidence are difficult to reveal using traditional additive models. The goal of this study was to improve upon the established Re...

Full description

Bibliographic Details
Main Authors: Agni Orfanoudaki, Emma Chesley, Christian Cadisch, Barry Stein, Amre Nouh, Mark J Alberts, Dimitris Bertsimas
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2020-01-01
Series:PLoS ONE
Online Access:https://doi.org/10.1371/journal.pone.0232414
_version_ 1818734226313314304
author Agni Orfanoudaki
Emma Chesley
Christian Cadisch
Barry Stein
Amre Nouh
Mark J Alberts
Dimitris Bertsimas
author_facet Agni Orfanoudaki
Emma Chesley
Christian Cadisch
Barry Stein
Amre Nouh
Mark J Alberts
Dimitris Bertsimas
author_sort Agni Orfanoudaki
collection DOAJ
description Current stroke risk assessment tools presume the impact of risk factors is linear and cumulative. However, both novel risk factors and their interplay influencing stroke incidence are difficult to reveal using traditional additive models. The goal of this study was to improve upon the established Revised Framingham Stroke Risk Score and design an interactive Non-Linear Stroke Risk Score. Leveraging machine learning algorithms, our work aimed at increasing the accuracy of event prediction and uncovering new relationships in an interpretable fashion. A two-phase approach was used to create our stroke risk prediction score. First, clinical examinations of the Framingham offspring cohort were utilized as the training dataset for the predictive model. Optimal Classification Trees were used to develop a tree-based model to predict 10-year risk of stroke. Unlike classical methods, this algorithm adaptively changes the splits on the independent variables, introducing non-linear interactions among them. Second, the model was validated with a multi-ethnicity cohort from the Boston Medical Center. Our stroke risk score suggests a key dichotomy between patients with history of cardiovascular disease and the rest of the population. While it agrees with known findings, it also identified 23 unique stroke risk profiles and highlighted new non-linear relationships; such as the role of T-wave abnormality on electrocardiography and hematocrit levels in a patient's risk profile. Our results suggested that the non-linear approach significantly improves upon the baseline in the c-statistic (training 87.43% (CI 0.85-0.90) vs. 73.74% (CI 0.70-0.76); validation 75.29% (CI 0.74-0.76) vs 65.93% (CI 0.64-0.67), even in multi-ethnicity populations. The clinical implications of the new risk score include prioritization of risk factor modification and personalized care at the patient level with improved targeting of interventions for stroke prevention.
first_indexed 2024-12-18T00:01:59Z
format Article
id doaj.art-55b9c39fd61b4b6cacb34739bf88e82a
institution Directory Open Access Journal
issn 1932-6203
language English
last_indexed 2024-12-18T00:01:59Z
publishDate 2020-01-01
publisher Public Library of Science (PLoS)
record_format Article
series PLoS ONE
spelling doaj.art-55b9c39fd61b4b6cacb34739bf88e82a2022-12-21T21:27:55ZengPublic Library of Science (PLoS)PLoS ONE1932-62032020-01-01155e023241410.1371/journal.pone.0232414Machine learning provides evidence that stroke risk is not linear: The non-linear Framingham stroke risk score.Agni OrfanoudakiEmma ChesleyChristian CadischBarry SteinAmre NouhMark J AlbertsDimitris BertsimasCurrent stroke risk assessment tools presume the impact of risk factors is linear and cumulative. However, both novel risk factors and their interplay influencing stroke incidence are difficult to reveal using traditional additive models. The goal of this study was to improve upon the established Revised Framingham Stroke Risk Score and design an interactive Non-Linear Stroke Risk Score. Leveraging machine learning algorithms, our work aimed at increasing the accuracy of event prediction and uncovering new relationships in an interpretable fashion. A two-phase approach was used to create our stroke risk prediction score. First, clinical examinations of the Framingham offspring cohort were utilized as the training dataset for the predictive model. Optimal Classification Trees were used to develop a tree-based model to predict 10-year risk of stroke. Unlike classical methods, this algorithm adaptively changes the splits on the independent variables, introducing non-linear interactions among them. Second, the model was validated with a multi-ethnicity cohort from the Boston Medical Center. Our stroke risk score suggests a key dichotomy between patients with history of cardiovascular disease and the rest of the population. While it agrees with known findings, it also identified 23 unique stroke risk profiles and highlighted new non-linear relationships; such as the role of T-wave abnormality on electrocardiography and hematocrit levels in a patient's risk profile. Our results suggested that the non-linear approach significantly improves upon the baseline in the c-statistic (training 87.43% (CI 0.85-0.90) vs. 73.74% (CI 0.70-0.76); validation 75.29% (CI 0.74-0.76) vs 65.93% (CI 0.64-0.67), even in multi-ethnicity populations. The clinical implications of the new risk score include prioritization of risk factor modification and personalized care at the patient level with improved targeting of interventions for stroke prevention.https://doi.org/10.1371/journal.pone.0232414
spellingShingle Agni Orfanoudaki
Emma Chesley
Christian Cadisch
Barry Stein
Amre Nouh
Mark J Alberts
Dimitris Bertsimas
Machine learning provides evidence that stroke risk is not linear: The non-linear Framingham stroke risk score.
PLoS ONE
title Machine learning provides evidence that stroke risk is not linear: The non-linear Framingham stroke risk score.
title_full Machine learning provides evidence that stroke risk is not linear: The non-linear Framingham stroke risk score.
title_fullStr Machine learning provides evidence that stroke risk is not linear: The non-linear Framingham stroke risk score.
title_full_unstemmed Machine learning provides evidence that stroke risk is not linear: The non-linear Framingham stroke risk score.
title_short Machine learning provides evidence that stroke risk is not linear: The non-linear Framingham stroke risk score.
title_sort machine learning provides evidence that stroke risk is not linear the non linear framingham stroke risk score
url https://doi.org/10.1371/journal.pone.0232414
work_keys_str_mv AT agniorfanoudaki machinelearningprovidesevidencethatstrokeriskisnotlinearthenonlinearframinghamstrokeriskscore
AT emmachesley machinelearningprovidesevidencethatstrokeriskisnotlinearthenonlinearframinghamstrokeriskscore
AT christiancadisch machinelearningprovidesevidencethatstrokeriskisnotlinearthenonlinearframinghamstrokeriskscore
AT barrystein machinelearningprovidesevidencethatstrokeriskisnotlinearthenonlinearframinghamstrokeriskscore
AT amrenouh machinelearningprovidesevidencethatstrokeriskisnotlinearthenonlinearframinghamstrokeriskscore
AT markjalberts machinelearningprovidesevidencethatstrokeriskisnotlinearthenonlinearframinghamstrokeriskscore
AT dimitrisbertsimas machinelearningprovidesevidencethatstrokeriskisnotlinearthenonlinearframinghamstrokeriskscore