An Ensemble-Learning Based Application to Predict the Earlier Stages of Alzheimer’s Disease (AD)

The fact that ensemble methods enhance the prediction performance. Therefore, we focused on developing a weighted ensemble method using a novel combination of Cerebrospinal Fluid (CSF) protein biomarkers to predict AD's earlier stages with greater accuracy than the state-of-the-art CSF protein...

Full description

Bibliographic Details
Main Authors: Asif Hassan Syed, Tabrej Khan, Atif Hassan, Nashwan A. Alromema, Muhammad Binsawad, Alhuseen Omar Alsayed
Format: Article
Language:English
Published: IEEE 2020-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/9288747/
_version_ 1818662533877202944
author Asif Hassan Syed
Tabrej Khan
Atif Hassan
Nashwan A. Alromema
Muhammad Binsawad
Alhuseen Omar Alsayed
author_facet Asif Hassan Syed
Tabrej Khan
Atif Hassan
Nashwan A. Alromema
Muhammad Binsawad
Alhuseen Omar Alsayed
author_sort Asif Hassan Syed
collection DOAJ
description The fact that ensemble methods enhance the prediction performance. Therefore, we focused on developing a weighted ensemble method using a novel combination of Cerebrospinal Fluid (CSF) protein biomarkers to predict AD's earlier stages with greater accuracy than the state-of-the-art CSF protein biomarkers. In this regard, two feature selection methods, namely the Recursive Feature Elimination (RFE) and L1 regularization method were used to screen the most important subset of features for building a classification model using the Mild Cognitive Impairment (MCI) dataset. A novel combination of three biomarkers, namely Cystatin C, Matrix metalloproteinases (MMP10), and tau protein, was screened using the linear Support Vector Machine (SVM) and Logistic Regression (LR) classifier based RFE method. Two-tailed unpaired t-test analysis at a 5% significance level showed a significant difference between the mean levels of Cystatin C, MMP10, and tau protein between cognitive normal and cognitively impaired groups. An ensemble model using a weighted average of two best performing classifiers (LR and Linear SVM) was created using a novel subset of three most informative features. Our ensemble model's weighted average results performed significantly better than LR and Linear SVM base classifiers' performance. The Receiver Operating Characteristic Curve (ROC_AUC) and Area under Precision-Recall values (AUPR) of our proposed model were observed to be 0.9799 ± 0.055 0.9108 ± 0.015, respectively. The performance of our proposed weighted averaged ensemble model built using a novel combination of CSF protein biomarkers was significantly better (p <; 0.001) than models generated using different combinations of CSF protein biomarkers obtained from recent studies. An ensemble-learning based application was implemented and deployed at Heroku at https://appsalzheimer.herokuapp.com.
first_indexed 2024-12-17T05:02:28Z
format Article
id doaj.art-e5b9997185a44136a718a0a2770c1113
institution Directory Open Access Journal
issn 2169-3536
language English
last_indexed 2024-12-17T05:02:28Z
publishDate 2020-01-01
publisher IEEE
record_format Article
series IEEE Access
spelling doaj.art-e5b9997185a44136a718a0a2770c11132022-12-21T22:02:31ZengIEEEIEEE Access2169-35362020-01-01822212622214310.1109/ACCESS.2020.30437159288747An Ensemble-Learning Based Application to Predict the Earlier Stages of Alzheimer’s Disease (AD)Asif Hassan Syed0https://orcid.org/0000-0002-2699-9102Tabrej Khan1https://orcid.org/0000-0002-9510-8537Atif Hassan2Nashwan A. Alromema3https://orcid.org/0000-0001-6208-2863Muhammad Binsawad4https://orcid.org/0000-0003-0915-7058Alhuseen Omar Alsayed5https://orcid.org/0000-0003-3044-799XDepartment of Computer Science, Faculty of Computing and Information Technology in Rabigh (FCITR), King Abdulaziz University, Jeddah, Saudi ArabiaDepartment of Information Systems, Faculty of Computing and Information Technology in Rabigh (FCITR), King Abdulaziz University, Jeddah, Saudi ArabiaDepartment of Computer Science and Engineering, Indian Institute of Technology Kharagpur, Kharagpur, IndiaDepartment of Computer Science, Faculty of Computing and Information Technology in Rabigh (FCITR), King Abdulaziz University, Jeddah, Saudi ArabiaDepartment of Computer Science, Faculty of Computing and Information Technology in Rabigh (FCITR), King Abdulaziz University, Jeddah, Saudi ArabiaScientific Researcher, Deanship of Scientific Research, King Abdulaziz University, Jeddah, Saudi ArabiaThe fact that ensemble methods enhance the prediction performance. Therefore, we focused on developing a weighted ensemble method using a novel combination of Cerebrospinal Fluid (CSF) protein biomarkers to predict AD's earlier stages with greater accuracy than the state-of-the-art CSF protein biomarkers. In this regard, two feature selection methods, namely the Recursive Feature Elimination (RFE) and L1 regularization method were used to screen the most important subset of features for building a classification model using the Mild Cognitive Impairment (MCI) dataset. A novel combination of three biomarkers, namely Cystatin C, Matrix metalloproteinases (MMP10), and tau protein, was screened using the linear Support Vector Machine (SVM) and Logistic Regression (LR) classifier based RFE method. Two-tailed unpaired t-test analysis at a 5% significance level showed a significant difference between the mean levels of Cystatin C, MMP10, and tau protein between cognitive normal and cognitively impaired groups. An ensemble model using a weighted average of two best performing classifiers (LR and Linear SVM) was created using a novel subset of three most informative features. Our ensemble model's weighted average results performed significantly better than LR and Linear SVM base classifiers' performance. The Receiver Operating Characteristic Curve (ROC_AUC) and Area under Precision-Recall values (AUPR) of our proposed model were observed to be 0.9799 ± 0.055 0.9108 ± 0.015, respectively. The performance of our proposed weighted averaged ensemble model built using a novel combination of CSF protein biomarkers was significantly better (p <; 0.001) than models generated using different combinations of CSF protein biomarkers obtained from recent studies. An ensemble-learning based application was implemented and deployed at Heroku at https://appsalzheimer.herokuapp.com.https://ieeexplore.ieee.org/document/9288747/Mild cognitive impairment (MCI)cerebrospinal fluid (CSF) protein biomarkersclassification modelonline prediction system
spellingShingle Asif Hassan Syed
Tabrej Khan
Atif Hassan
Nashwan A. Alromema
Muhammad Binsawad
Alhuseen Omar Alsayed
An Ensemble-Learning Based Application to Predict the Earlier Stages of Alzheimer’s Disease (AD)
IEEE Access
Mild cognitive impairment (MCI)
cerebrospinal fluid (CSF) protein biomarkers
classification model
online prediction system
title An Ensemble-Learning Based Application to Predict the Earlier Stages of Alzheimer’s Disease (AD)
title_full An Ensemble-Learning Based Application to Predict the Earlier Stages of Alzheimer’s Disease (AD)
title_fullStr An Ensemble-Learning Based Application to Predict the Earlier Stages of Alzheimer’s Disease (AD)
title_full_unstemmed An Ensemble-Learning Based Application to Predict the Earlier Stages of Alzheimer’s Disease (AD)
title_short An Ensemble-Learning Based Application to Predict the Earlier Stages of Alzheimer’s Disease (AD)
title_sort ensemble learning based application to predict the earlier stages of alzheimer x2019 s disease ad
topic Mild cognitive impairment (MCI)
cerebrospinal fluid (CSF) protein biomarkers
classification model
online prediction system
url https://ieeexplore.ieee.org/document/9288747/
work_keys_str_mv AT asifhassansyed anensemblelearningbasedapplicationtopredicttheearlierstagesofalzheimerx2019sdiseasead
AT tabrejkhan anensemblelearningbasedapplicationtopredicttheearlierstagesofalzheimerx2019sdiseasead
AT atifhassan anensemblelearningbasedapplicationtopredicttheearlierstagesofalzheimerx2019sdiseasead
AT nashwanaalromema anensemblelearningbasedapplicationtopredicttheearlierstagesofalzheimerx2019sdiseasead
AT muhammadbinsawad anensemblelearningbasedapplicationtopredicttheearlierstagesofalzheimerx2019sdiseasead
AT alhuseenomaralsayed anensemblelearningbasedapplicationtopredicttheearlierstagesofalzheimerx2019sdiseasead
AT asifhassansyed ensemblelearningbasedapplicationtopredicttheearlierstagesofalzheimerx2019sdiseasead
AT tabrejkhan ensemblelearningbasedapplicationtopredicttheearlierstagesofalzheimerx2019sdiseasead
AT atifhassan ensemblelearningbasedapplicationtopredicttheearlierstagesofalzheimerx2019sdiseasead
AT nashwanaalromema ensemblelearningbasedapplicationtopredicttheearlierstagesofalzheimerx2019sdiseasead
AT muhammadbinsawad ensemblelearningbasedapplicationtopredicttheearlierstagesofalzheimerx2019sdiseasead
AT alhuseenomaralsayed ensemblelearningbasedapplicationtopredicttheearlierstagesofalzheimerx2019sdiseasead