An explainable machine learning approach for Alzheimer’s disease classification

Abstract The early diagnosis of Alzheimer’s disease (AD) presents a significant challenge due to the subtle biomarker changes often overlooked. Machine learning (ML) models offer a promising tool for identifying individuals at risk of AD. However, current research tends to prioritize ML accuracy whi...

Full description

Bibliographic Details
Main Authors: Abbas Saad Alatrany, Wasiq Khan, Abir Hussain, Hoshang Kolivand, Dhiya Al-Jumeily
Format: Article
Language:English
Published: Nature Portfolio 2024-02-01
Series:Scientific Reports
Online Access:https://doi.org/10.1038/s41598-024-51985-w
_version_ 1827327615537512448
author Abbas Saad Alatrany
Wasiq Khan
Abir Hussain
Hoshang Kolivand
Dhiya Al-Jumeily
author_facet Abbas Saad Alatrany
Wasiq Khan
Abir Hussain
Hoshang Kolivand
Dhiya Al-Jumeily
author_sort Abbas Saad Alatrany
collection DOAJ
description Abstract The early diagnosis of Alzheimer’s disease (AD) presents a significant challenge due to the subtle biomarker changes often overlooked. Machine learning (ML) models offer a promising tool for identifying individuals at risk of AD. However, current research tends to prioritize ML accuracy while neglecting the crucial aspect of model explainability. The diverse nature of AD data and the limited dataset size introduce additional challenges, primarily related to high dimensionality. In this study, we leveraged a dataset obtained from the National Alzheimer’s Coordinating Center, comprising 169,408 records and 1024 features. After applying various steps to reduce the feature space. Notably, support vector machine (SVM) models trained on the selected features exhibited high performance when tested on an external dataset. SVM achieved a high F1 score of 98.9% for binary classification (distinguishing between NC and AD) and 90.7% for multiclass classification. Furthermore, SVM was able to predict AD progression over a 4-year period, with F1 scores reached 88% for binary task and 72.8% for multiclass task. To enhance model explainability, we employed two rule-extraction approaches: class rule mining and stable and interpretable rule set for classification model. These approaches generated human-understandable rules to assist domain experts in comprehending the key factors involved in AD development. We further validated these rules using SHAP and LIME models, underscoring the significance of factors such as MEMORY, JUDGMENT, COMMUN, and ORIENT in determining AD risk. Our experimental outcomes also shed light on the crucial role of the Clinical Dementia Rating tool in predicting AD.
first_indexed 2024-03-07T15:03:53Z
format Article
id doaj.art-fc7bcc3a3cfe4c4aac30ccf318e8d18c
institution Directory Open Access Journal
issn 2045-2322
language English
last_indexed 2024-03-07T15:03:53Z
publishDate 2024-02-01
publisher Nature Portfolio
record_format Article
series Scientific Reports
spelling doaj.art-fc7bcc3a3cfe4c4aac30ccf318e8d18c2024-03-05T19:01:34ZengNature PortfolioScientific Reports2045-23222024-02-0114111810.1038/s41598-024-51985-wAn explainable machine learning approach for Alzheimer’s disease classificationAbbas Saad Alatrany0Wasiq Khan1Abir Hussain2Hoshang Kolivand3Dhiya Al-Jumeily4School of Computer Science and Mathematics, Liverpool John Moores UniversitySchool of Computer Science and Mathematics, Liverpool John Moores UniversitySchool of Computer Science and Mathematics, Liverpool John Moores UniversitySchool of Computer Science and Mathematics, Liverpool John Moores UniversitySchool of Computer Science and Mathematics, Liverpool John Moores UniversityAbstract The early diagnosis of Alzheimer’s disease (AD) presents a significant challenge due to the subtle biomarker changes often overlooked. Machine learning (ML) models offer a promising tool for identifying individuals at risk of AD. However, current research tends to prioritize ML accuracy while neglecting the crucial aspect of model explainability. The diverse nature of AD data and the limited dataset size introduce additional challenges, primarily related to high dimensionality. In this study, we leveraged a dataset obtained from the National Alzheimer’s Coordinating Center, comprising 169,408 records and 1024 features. After applying various steps to reduce the feature space. Notably, support vector machine (SVM) models trained on the selected features exhibited high performance when tested on an external dataset. SVM achieved a high F1 score of 98.9% for binary classification (distinguishing between NC and AD) and 90.7% for multiclass classification. Furthermore, SVM was able to predict AD progression over a 4-year period, with F1 scores reached 88% for binary task and 72.8% for multiclass task. To enhance model explainability, we employed two rule-extraction approaches: class rule mining and stable and interpretable rule set for classification model. These approaches generated human-understandable rules to assist domain experts in comprehending the key factors involved in AD development. We further validated these rules using SHAP and LIME models, underscoring the significance of factors such as MEMORY, JUDGMENT, COMMUN, and ORIENT in determining AD risk. Our experimental outcomes also shed light on the crucial role of the Clinical Dementia Rating tool in predicting AD.https://doi.org/10.1038/s41598-024-51985-w
spellingShingle Abbas Saad Alatrany
Wasiq Khan
Abir Hussain
Hoshang Kolivand
Dhiya Al-Jumeily
An explainable machine learning approach for Alzheimer’s disease classification
Scientific Reports
title An explainable machine learning approach for Alzheimer’s disease classification
title_full An explainable machine learning approach for Alzheimer’s disease classification
title_fullStr An explainable machine learning approach for Alzheimer’s disease classification
title_full_unstemmed An explainable machine learning approach for Alzheimer’s disease classification
title_short An explainable machine learning approach for Alzheimer’s disease classification
title_sort explainable machine learning approach for alzheimer s disease classification
url https://doi.org/10.1038/s41598-024-51985-w
work_keys_str_mv AT abbassaadalatrany anexplainablemachinelearningapproachforalzheimersdiseaseclassification
AT wasiqkhan anexplainablemachinelearningapproachforalzheimersdiseaseclassification
AT abirhussain anexplainablemachinelearningapproachforalzheimersdiseaseclassification
AT hoshangkolivand anexplainablemachinelearningapproachforalzheimersdiseaseclassification
AT dhiyaaljumeily anexplainablemachinelearningapproachforalzheimersdiseaseclassification
AT abbassaadalatrany explainablemachinelearningapproachforalzheimersdiseaseclassification
AT wasiqkhan explainablemachinelearningapproachforalzheimersdiseaseclassification
AT abirhussain explainablemachinelearningapproachforalzheimersdiseaseclassification
AT hoshangkolivand explainablemachinelearningapproachforalzheimersdiseaseclassification
AT dhiyaaljumeily explainablemachinelearningapproachforalzheimersdiseaseclassification