Detection of Diabetes through Microarray Genes with Enhancement of Classifiers Performance

Diabetes is a life-threatening, non-communicable disease. Diabetes mellitus is a prevalent chronic disease with a significant global impact. The timely detection of diabetes in patients is necessary for an effective treatment. The primary objective of this study is to propose a novel approach for id...

Full description

Bibliographic Details
Main Authors: Dinesh Chellappan, Harikumar Rajaguru
Format: Article
Language:English
Published: MDPI AG 2023-08-01
Series:Diagnostics
Subjects:
Online Access:https://www.mdpi.com/2075-4418/13/16/2654
_version_ 1797585028079681536
author Dinesh Chellappan
Harikumar Rajaguru
author_facet Dinesh Chellappan
Harikumar Rajaguru
author_sort Dinesh Chellappan
collection DOAJ
description Diabetes is a life-threatening, non-communicable disease. Diabetes mellitus is a prevalent chronic disease with a significant global impact. The timely detection of diabetes in patients is necessary for an effective treatment. The primary objective of this study is to propose a novel approach for identifying type II diabetes mellitus using microarray gene data. Specifically, our research focuses on the performance enhancement of methods for detecting diabetes. Four different Dimensionality Reduction techniques, Detrend Fluctuation Analysis (DFA), the Chi-square probability density function (Chi2pdf), the Firefly algorithm, and Cuckoo Search, are used to reduce high dimensional data. Metaheuristic algorithms like Particle Swarm Optimization (PSO) and Harmonic Search (HS) are used for feature selection. Seven classifiers, Non-Linear Regression (NLR), Linear Regression (LR), Logistics Regression (LoR), Gaussian Mixture Model (GMM), Bayesian Linear Discriminant Classifier (BLDC), Softmax Discriminant Classifier (SDC), and Support Vector Machine—Radial Basis Function (SVM-RBF), are utilized to classify the diabetic and non-diabetic classes. The classifiers’ performances are analyzed through parameters such as accuracy, recall, precision, F1 score, error rate, Matthews Correlation Coefficient (MCC), Jaccard metric, and kappa. The SVM (RBF) classifier with the Chi2pdf Dimensionality Reduction technique with a PSO feature selection method attained a high accuracy of 91% with a Kappa of 0.7961, outperforming all of the other classifiers.
first_indexed 2024-03-10T23:59:58Z
format Article
id doaj.art-23c68f9768f54ea08107a0d70b725956
institution Directory Open Access Journal
issn 2075-4418
language English
last_indexed 2024-03-10T23:59:58Z
publishDate 2023-08-01
publisher MDPI AG
record_format Article
series Diagnostics
spelling doaj.art-23c68f9768f54ea08107a0d70b7259562023-11-19T00:48:15ZengMDPI AGDiagnostics2075-44182023-08-011316265410.3390/diagnostics13162654Detection of Diabetes through Microarray Genes with Enhancement of Classifiers PerformanceDinesh Chellappan0Harikumar Rajaguru1Department of Electrical and Electronics Engineering, KPR Institute of Engineering and Technology, Coimbatore 641 407, Tamil Nadu, IndiaDepartment of Electronics and Communication Engineering, Bannari Amman Institute of Technology, Sathyamangalam 638 401, Tamil Nadu, IndiaDiabetes is a life-threatening, non-communicable disease. Diabetes mellitus is a prevalent chronic disease with a significant global impact. The timely detection of diabetes in patients is necessary for an effective treatment. The primary objective of this study is to propose a novel approach for identifying type II diabetes mellitus using microarray gene data. Specifically, our research focuses on the performance enhancement of methods for detecting diabetes. Four different Dimensionality Reduction techniques, Detrend Fluctuation Analysis (DFA), the Chi-square probability density function (Chi2pdf), the Firefly algorithm, and Cuckoo Search, are used to reduce high dimensional data. Metaheuristic algorithms like Particle Swarm Optimization (PSO) and Harmonic Search (HS) are used for feature selection. Seven classifiers, Non-Linear Regression (NLR), Linear Regression (LR), Logistics Regression (LoR), Gaussian Mixture Model (GMM), Bayesian Linear Discriminant Classifier (BLDC), Softmax Discriminant Classifier (SDC), and Support Vector Machine—Radial Basis Function (SVM-RBF), are utilized to classify the diabetic and non-diabetic classes. The classifiers’ performances are analyzed through parameters such as accuracy, recall, precision, F1 score, error rate, Matthews Correlation Coefficient (MCC), Jaccard metric, and kappa. The SVM (RBF) classifier with the Chi2pdf Dimensionality Reduction technique with a PSO feature selection method attained a high accuracy of 91% with a Kappa of 0.7961, outperforming all of the other classifiers.https://www.mdpi.com/2075-4418/13/16/2654type II diabetes mellitusmachine learningpredictionDimensionality Reductionclassifiers
spellingShingle Dinesh Chellappan
Harikumar Rajaguru
Detection of Diabetes through Microarray Genes with Enhancement of Classifiers Performance
Diagnostics
type II diabetes mellitus
machine learning
prediction
Dimensionality Reduction
classifiers
title Detection of Diabetes through Microarray Genes with Enhancement of Classifiers Performance
title_full Detection of Diabetes through Microarray Genes with Enhancement of Classifiers Performance
title_fullStr Detection of Diabetes through Microarray Genes with Enhancement of Classifiers Performance
title_full_unstemmed Detection of Diabetes through Microarray Genes with Enhancement of Classifiers Performance
title_short Detection of Diabetes through Microarray Genes with Enhancement of Classifiers Performance
title_sort detection of diabetes through microarray genes with enhancement of classifiers performance
topic type II diabetes mellitus
machine learning
prediction
Dimensionality Reduction
classifiers
url https://www.mdpi.com/2075-4418/13/16/2654
work_keys_str_mv AT dineshchellappan detectionofdiabetesthroughmicroarraygeneswithenhancementofclassifiersperformance
AT harikumarrajaguru detectionofdiabetesthroughmicroarraygeneswithenhancementofclassifiersperformance