Pixel-Level Fusion Approach with Vision Transformer for Early Detection of Alzheimer’s Disease

Alzheimer’s disease (AD) has become a serious hazard to human health in recent years, and proper screening and diagnosis of AD remain a challenge. Multimodal neuroimaging input can help identify AD in the early mild cognitive impairment (EMCI) and late mild cognitive impairment (LMCI) stages from no...

Full description

Bibliographic Details
Main Authors: Modupe Odusami, Rytis Maskeliūnas, Robertas Damaševičius
Format: Article
Language:English
Published: MDPI AG 2023-03-01
Series:Electronics
Subjects:
Online Access:https://www.mdpi.com/2079-9292/12/5/1218
_version_ 1797615521374404608
author Modupe Odusami
Rytis Maskeliūnas
Robertas Damaševičius
author_facet Modupe Odusami
Rytis Maskeliūnas
Robertas Damaševičius
author_sort Modupe Odusami
collection DOAJ
description Alzheimer’s disease (AD) has become a serious hazard to human health in recent years, and proper screening and diagnosis of AD remain a challenge. Multimodal neuroimaging input can help identify AD in the early mild cognitive impairment (EMCI) and late mild cognitive impairment (LMCI) stages from normal cognitive development using magnetic resonance imaging (MRI) and positron emission tomography (PET). MRI provides useful information on brain structural abnormalities, while PET data provide the difference between physiological and pathological changes in brain anatomy. The precision of diagnosing AD can increase when these data are combined. However, they are heterogeneous and appropriate, and an adequate number of features are required for AD classification. This paper proposed a multimodal fusion-based approach that uses a mathematical technique called discrete wavelet transform (DWT) to analyse the data, and the optimisation of this technique is achieved through transfer learning using a pre-trained neural network called VGG16. The final fused image is reconstructed using inverse discrete wavelet transform (IDWT). The fused images are classified using a pre-trained vision transformer. The evaluation of the benchmark Alzheimer’s disease neuroimaging initiative (ADNI) dataset shows an accuracy of 81.25% for AD/EMCI and AD/LMCI in MRI test data, as well as 93.75% for AD/EMCI and AD/LMCI in PET test data. The proposed model performed better than existing studies when tested on PET data with an accuracy of 93.75%.
first_indexed 2024-03-11T07:26:34Z
format Article
id doaj.art-7c47188a6dca48838e76a1dfc7632fdc
institution Directory Open Access Journal
issn 2079-9292
language English
last_indexed 2024-03-11T07:26:34Z
publishDate 2023-03-01
publisher MDPI AG
record_format Article
series Electronics
spelling doaj.art-7c47188a6dca48838e76a1dfc7632fdc2023-11-17T07:33:17ZengMDPI AGElectronics2079-92922023-03-01125121810.3390/electronics12051218Pixel-Level Fusion Approach with Vision Transformer for Early Detection of Alzheimer’s DiseaseModupe Odusami0Rytis Maskeliūnas1Robertas Damaševičius2Center of Excellence Forest 4.0, Faculty of Informatics, Kaunas University of Technology, 44249 Kaunas, LithuaniaCenter of Excellence Forest 4.0, Faculty of Informatics, Kaunas University of Technology, 44249 Kaunas, LithuaniaCenter of Excellence Forest 4.0, Faculty of Informatics, Kaunas University of Technology, 44249 Kaunas, LithuaniaAlzheimer’s disease (AD) has become a serious hazard to human health in recent years, and proper screening and diagnosis of AD remain a challenge. Multimodal neuroimaging input can help identify AD in the early mild cognitive impairment (EMCI) and late mild cognitive impairment (LMCI) stages from normal cognitive development using magnetic resonance imaging (MRI) and positron emission tomography (PET). MRI provides useful information on brain structural abnormalities, while PET data provide the difference between physiological and pathological changes in brain anatomy. The precision of diagnosing AD can increase when these data are combined. However, they are heterogeneous and appropriate, and an adequate number of features are required for AD classification. This paper proposed a multimodal fusion-based approach that uses a mathematical technique called discrete wavelet transform (DWT) to analyse the data, and the optimisation of this technique is achieved through transfer learning using a pre-trained neural network called VGG16. The final fused image is reconstructed using inverse discrete wavelet transform (IDWT). The fused images are classified using a pre-trained vision transformer. The evaluation of the benchmark Alzheimer’s disease neuroimaging initiative (ADNI) dataset shows an accuracy of 81.25% for AD/EMCI and AD/LMCI in MRI test data, as well as 93.75% for AD/EMCI and AD/LMCI in PET test data. The proposed model performed better than existing studies when tested on PET data with an accuracy of 93.75%.https://www.mdpi.com/2079-9292/12/5/1218Alzheimer’s diseaseMRIPETdata fusionvision transformer
spellingShingle Modupe Odusami
Rytis Maskeliūnas
Robertas Damaševičius
Pixel-Level Fusion Approach with Vision Transformer for Early Detection of Alzheimer’s Disease
Electronics
Alzheimer’s disease
MRI
PET
data fusion
vision transformer
title Pixel-Level Fusion Approach with Vision Transformer for Early Detection of Alzheimer’s Disease
title_full Pixel-Level Fusion Approach with Vision Transformer for Early Detection of Alzheimer’s Disease
title_fullStr Pixel-Level Fusion Approach with Vision Transformer for Early Detection of Alzheimer’s Disease
title_full_unstemmed Pixel-Level Fusion Approach with Vision Transformer for Early Detection of Alzheimer’s Disease
title_short Pixel-Level Fusion Approach with Vision Transformer for Early Detection of Alzheimer’s Disease
title_sort pixel level fusion approach with vision transformer for early detection of alzheimer s disease
topic Alzheimer’s disease
MRI
PET
data fusion
vision transformer
url https://www.mdpi.com/2079-9292/12/5/1218
work_keys_str_mv AT modupeodusami pixellevelfusionapproachwithvisiontransformerforearlydetectionofalzheimersdisease
AT rytismaskeliunas pixellevelfusionapproachwithvisiontransformerforearlydetectionofalzheimersdisease
AT robertasdamasevicius pixellevelfusionapproachwithvisiontransformerforearlydetectionofalzheimersdisease