RFFE – Random Forest Fuzzy Entropy for the classification of Diabetes Mellitus

Diabetes is a category of metabolic disease commonly known as a chronic illness. It causes the body to generate less insulin and raises blood sugar levels, leading to various issues and disrupting the functioning of organs, including the retinal, kidney and nerves. To prevent this, people with chron...

Full description

Bibliographic Details
Main Authors: A. Usha Ruby, J George Chellin Chandran, TJ Swasthika Jain, BN Chaithanya, Renuka Patil
Format: Article
Language:English
Published: AIMS Press 2023-05-01
Series:AIMS Public Health
Subjects:
Online Access:https://www.aimspress.com/article/doi/10.3934/publichealth.2023030?viewType=HTML
_version_ 1797804689464492032
author A. Usha Ruby
J George Chellin Chandran
TJ Swasthika Jain
BN Chaithanya
Renuka Patil
author_facet A. Usha Ruby
J George Chellin Chandran
TJ Swasthika Jain
BN Chaithanya
Renuka Patil
author_sort A. Usha Ruby
collection DOAJ
description Diabetes is a category of metabolic disease commonly known as a chronic illness. It causes the body to generate less insulin and raises blood sugar levels, leading to various issues and disrupting the functioning of organs, including the retinal, kidney and nerves. To prevent this, people with chronic illnesses require lifetime access to treatment. As a result, early diabetes detection is essential and might save many lives. Diagnosis of people at high risk of developing diabetes is utilized for preventing the disease in various aspects. This article presents a chronic illness prediction prototype based on a person's risk feature data to provide an early prediction for diabetes with Fuzzy Entropy random vectors that regulate the development of each tree in the Random Forest. The proposed prototype consists of data imputation, data sampling, feature selection, and various techniques to predict the disease, such as Fuzzy Entropy, Synthetic Minority Oversampling Technique (SMOTE), Convolutional Neural Network (CNN) with Stochastic Gradient Descent with Momentum (SGDM), Support Vector Machines (SVM), Classification and Regression Tree (CART), K-Nearest Neighbor (KNN), and Naïve Bayes (NB). This study uses the existing Pima Indian Diabetes (PID) dataset for diabetic disease prediction. The predictions' true/false positive/negative rate is investigated using the confusion matrix and the receiver operating characteristic area under the curve (ROCAUC). Findings on a PID dataset are compared with machine learning algorithms revealing that the proposed Random Forest Fuzzy Entropy (RFFE) is a valuable approach for diabetes prediction, with an accuracy of 98 percent.
first_indexed 2024-03-13T05:40:59Z
format Article
id doaj.art-bba26ebc4d0d41e5990d86f93533644f
institution Directory Open Access Journal
issn 2327-8994
language English
last_indexed 2024-03-13T05:40:59Z
publishDate 2023-05-01
publisher AIMS Press
record_format Article
series AIMS Public Health
spelling doaj.art-bba26ebc4d0d41e5990d86f93533644f2023-06-14T01:26:03ZengAIMS PressAIMS Public Health2327-89942023-05-0110242244210.3934/publichealth.2023030RFFE – Random Forest Fuzzy Entropy for the classification of Diabetes MellitusA. Usha Ruby0J George Chellin Chandran 1TJ Swasthika Jain2BN Chaithanya3Renuka Patil41. School of Computing Science and Engineering Department, VIT Bhopal University, Bhopal-Indore Highway, Kothrikalan, Sehore, Madhya Pradesh–466114, India1. School of Computing Science and Engineering Department, VIT Bhopal University, Bhopal-Indore Highway, Kothrikalan, Sehore, Madhya Pradesh–466114, India2. Department of Computer Science and Engineering, GITAM School of Technology, Nagadenehalli, Doddaballapura, Karnataka–561203, India2. Department of Computer Science and Engineering, GITAM School of Technology, Nagadenehalli, Doddaballapura, Karnataka–561203, India2. Department of Computer Science and Engineering, GITAM School of Technology, Nagadenehalli, Doddaballapura, Karnataka–561203, IndiaDiabetes is a category of metabolic disease commonly known as a chronic illness. It causes the body to generate less insulin and raises blood sugar levels, leading to various issues and disrupting the functioning of organs, including the retinal, kidney and nerves. To prevent this, people with chronic illnesses require lifetime access to treatment. As a result, early diabetes detection is essential and might save many lives. Diagnosis of people at high risk of developing diabetes is utilized for preventing the disease in various aspects. This article presents a chronic illness prediction prototype based on a person's risk feature data to provide an early prediction for diabetes with Fuzzy Entropy random vectors that regulate the development of each tree in the Random Forest. The proposed prototype consists of data imputation, data sampling, feature selection, and various techniques to predict the disease, such as Fuzzy Entropy, Synthetic Minority Oversampling Technique (SMOTE), Convolutional Neural Network (CNN) with Stochastic Gradient Descent with Momentum (SGDM), Support Vector Machines (SVM), Classification and Regression Tree (CART), K-Nearest Neighbor (KNN), and Naïve Bayes (NB). This study uses the existing Pima Indian Diabetes (PID) dataset for diabetic disease prediction. The predictions' true/false positive/negative rate is investigated using the confusion matrix and the receiver operating characteristic area under the curve (ROCAUC). Findings on a PID dataset are compared with machine learning algorithms revealing that the proposed Random Forest Fuzzy Entropy (RFFE) is a valuable approach for diabetes prediction, with an accuracy of 98 percent.https://www.aimspress.com/article/doi/10.3934/publichealth.2023030?viewType=HTMLdiabetes diseasesfuzzy entropymachine learningsynthetic gradient descent technique
spellingShingle A. Usha Ruby
J George Chellin Chandran
TJ Swasthika Jain
BN Chaithanya
Renuka Patil
RFFE – Random Forest Fuzzy Entropy for the classification of Diabetes Mellitus
AIMS Public Health
diabetes diseases
fuzzy entropy
machine learning
synthetic gradient descent technique
title RFFE – Random Forest Fuzzy Entropy for the classification of Diabetes Mellitus
title_full RFFE – Random Forest Fuzzy Entropy for the classification of Diabetes Mellitus
title_fullStr RFFE – Random Forest Fuzzy Entropy for the classification of Diabetes Mellitus
title_full_unstemmed RFFE – Random Forest Fuzzy Entropy for the classification of Diabetes Mellitus
title_short RFFE – Random Forest Fuzzy Entropy for the classification of Diabetes Mellitus
title_sort rffe random forest fuzzy entropy for the classification of diabetes mellitus
topic diabetes diseases
fuzzy entropy
machine learning
synthetic gradient descent technique
url https://www.aimspress.com/article/doi/10.3934/publichealth.2023030?viewType=HTML
work_keys_str_mv AT ausharuby rfferandomforestfuzzyentropyfortheclassificationofdiabetesmellitus
AT jgeorgechellinchandran rfferandomforestfuzzyentropyfortheclassificationofdiabetesmellitus
AT tjswasthikajain rfferandomforestfuzzyentropyfortheclassificationofdiabetesmellitus
AT bnchaithanya rfferandomforestfuzzyentropyfortheclassificationofdiabetesmellitus
AT renukapatil rfferandomforestfuzzyentropyfortheclassificationofdiabetesmellitus