RFFE – Random Forest Fuzzy Entropy for the classification of Diabetes Mellitus
Diabetes is a category of metabolic disease commonly known as a chronic illness. It causes the body to generate less insulin and raises blood sugar levels, leading to various issues and disrupting the functioning of organs, including the retinal, kidney and nerves. To prevent this, people with chron...
Main Authors: | , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
AIMS Press
2023-05-01
|
Series: | AIMS Public Health |
Subjects: | |
Online Access: | https://www.aimspress.com/article/doi/10.3934/publichealth.2023030?viewType=HTML |
_version_ | 1797804689464492032 |
---|---|
author | A. Usha Ruby J George Chellin Chandran TJ Swasthika Jain BN Chaithanya Renuka Patil |
author_facet | A. Usha Ruby J George Chellin Chandran TJ Swasthika Jain BN Chaithanya Renuka Patil |
author_sort | A. Usha Ruby |
collection | DOAJ |
description | Diabetes is a category of metabolic disease commonly known as a chronic illness. It causes the body to generate less insulin and raises blood sugar levels, leading to various issues and disrupting the functioning of organs, including the retinal, kidney and nerves. To prevent this, people with chronic illnesses require lifetime access to treatment. As a result, early diabetes detection is essential and might save many lives. Diagnosis of people at high risk of developing diabetes is utilized for preventing the disease in various aspects. This article presents a chronic illness prediction prototype based on a person's risk feature data to provide an early prediction for diabetes with Fuzzy Entropy random vectors that regulate the development of each tree in the Random Forest. The proposed prototype consists of data imputation, data sampling, feature selection, and various techniques to predict the disease, such as Fuzzy Entropy, Synthetic Minority Oversampling Technique (SMOTE), Convolutional Neural Network (CNN) with Stochastic Gradient Descent with Momentum (SGDM), Support Vector Machines (SVM), Classification and Regression Tree (CART), K-Nearest Neighbor (KNN), and Naïve Bayes (NB). This study uses the existing Pima Indian Diabetes (PID) dataset for diabetic disease prediction. The predictions' true/false positive/negative rate is investigated using the confusion matrix and the receiver operating characteristic area under the curve (ROCAUC). Findings on a PID dataset are compared with machine learning algorithms revealing that the proposed Random Forest Fuzzy Entropy (RFFE) is a valuable approach for diabetes prediction, with an accuracy of 98 percent. |
first_indexed | 2024-03-13T05:40:59Z |
format | Article |
id | doaj.art-bba26ebc4d0d41e5990d86f93533644f |
institution | Directory Open Access Journal |
issn | 2327-8994 |
language | English |
last_indexed | 2024-03-13T05:40:59Z |
publishDate | 2023-05-01 |
publisher | AIMS Press |
record_format | Article |
series | AIMS Public Health |
spelling | doaj.art-bba26ebc4d0d41e5990d86f93533644f2023-06-14T01:26:03ZengAIMS PressAIMS Public Health2327-89942023-05-0110242244210.3934/publichealth.2023030RFFE – Random Forest Fuzzy Entropy for the classification of Diabetes MellitusA. Usha Ruby0J George Chellin Chandran 1TJ Swasthika Jain2BN Chaithanya3Renuka Patil41. School of Computing Science and Engineering Department, VIT Bhopal University, Bhopal-Indore Highway, Kothrikalan, Sehore, Madhya Pradesh–466114, India1. School of Computing Science and Engineering Department, VIT Bhopal University, Bhopal-Indore Highway, Kothrikalan, Sehore, Madhya Pradesh–466114, India2. Department of Computer Science and Engineering, GITAM School of Technology, Nagadenehalli, Doddaballapura, Karnataka–561203, India2. Department of Computer Science and Engineering, GITAM School of Technology, Nagadenehalli, Doddaballapura, Karnataka–561203, India2. Department of Computer Science and Engineering, GITAM School of Technology, Nagadenehalli, Doddaballapura, Karnataka–561203, IndiaDiabetes is a category of metabolic disease commonly known as a chronic illness. It causes the body to generate less insulin and raises blood sugar levels, leading to various issues and disrupting the functioning of organs, including the retinal, kidney and nerves. To prevent this, people with chronic illnesses require lifetime access to treatment. As a result, early diabetes detection is essential and might save many lives. Diagnosis of people at high risk of developing diabetes is utilized for preventing the disease in various aspects. This article presents a chronic illness prediction prototype based on a person's risk feature data to provide an early prediction for diabetes with Fuzzy Entropy random vectors that regulate the development of each tree in the Random Forest. The proposed prototype consists of data imputation, data sampling, feature selection, and various techniques to predict the disease, such as Fuzzy Entropy, Synthetic Minority Oversampling Technique (SMOTE), Convolutional Neural Network (CNN) with Stochastic Gradient Descent with Momentum (SGDM), Support Vector Machines (SVM), Classification and Regression Tree (CART), K-Nearest Neighbor (KNN), and Naïve Bayes (NB). This study uses the existing Pima Indian Diabetes (PID) dataset for diabetic disease prediction. The predictions' true/false positive/negative rate is investigated using the confusion matrix and the receiver operating characteristic area under the curve (ROCAUC). Findings on a PID dataset are compared with machine learning algorithms revealing that the proposed Random Forest Fuzzy Entropy (RFFE) is a valuable approach for diabetes prediction, with an accuracy of 98 percent.https://www.aimspress.com/article/doi/10.3934/publichealth.2023030?viewType=HTMLdiabetes diseasesfuzzy entropymachine learningsynthetic gradient descent technique |
spellingShingle | A. Usha Ruby J George Chellin Chandran TJ Swasthika Jain BN Chaithanya Renuka Patil RFFE – Random Forest Fuzzy Entropy for the classification of Diabetes Mellitus AIMS Public Health diabetes diseases fuzzy entropy machine learning synthetic gradient descent technique |
title | RFFE – Random Forest Fuzzy Entropy for the classification of Diabetes Mellitus |
title_full | RFFE – Random Forest Fuzzy Entropy for the classification of Diabetes Mellitus |
title_fullStr | RFFE – Random Forest Fuzzy Entropy for the classification of Diabetes Mellitus |
title_full_unstemmed | RFFE – Random Forest Fuzzy Entropy for the classification of Diabetes Mellitus |
title_short | RFFE – Random Forest Fuzzy Entropy for the classification of Diabetes Mellitus |
title_sort | rffe random forest fuzzy entropy for the classification of diabetes mellitus |
topic | diabetes diseases fuzzy entropy machine learning synthetic gradient descent technique |
url | https://www.aimspress.com/article/doi/10.3934/publichealth.2023030?viewType=HTML |
work_keys_str_mv | AT ausharuby rfferandomforestfuzzyentropyfortheclassificationofdiabetesmellitus AT jgeorgechellinchandran rfferandomforestfuzzyentropyfortheclassificationofdiabetesmellitus AT tjswasthikajain rfferandomforestfuzzyentropyfortheclassificationofdiabetesmellitus AT bnchaithanya rfferandomforestfuzzyentropyfortheclassificationofdiabetesmellitus AT renukapatil rfferandomforestfuzzyentropyfortheclassificationofdiabetesmellitus |