Integrating Data Mining Techniques for Naïve Bayes Classification: Applications to Medical Datasets

In this study, we designed a framework in which three techniques—classification tree, association rules analysis (ASA), and the naïve Bayes classifier—were combined to improve the performance of the latter. A classification tree was used to discretize quantitative predictors into categories and ASA...

Full description

Bibliographic Details
Main Authors: Pannapa Changpetch, Apasiri Pitpeng, Sasiprapa Hiriote, Chumpol Yuangyai
Format: Article
Language:English
Published: MDPI AG 2021-09-01
Series:Computation
Subjects:
Online Access:https://www.mdpi.com/2079-3197/9/9/99
_version_ 1829452945498832896
author Pannapa Changpetch
Apasiri Pitpeng
Sasiprapa Hiriote
Chumpol Yuangyai
author_facet Pannapa Changpetch
Apasiri Pitpeng
Sasiprapa Hiriote
Chumpol Yuangyai
author_sort Pannapa Changpetch
collection DOAJ
description In this study, we designed a framework in which three techniques—classification tree, association rules analysis (ASA), and the naïve Bayes classifier—were combined to improve the performance of the latter. A classification tree was used to discretize quantitative predictors into categories and ASA was used to generate interactions in a fully realized way, as discretized variables and interactions are key to improving the classification accuracy of the naïve Bayes classifier. We applied our methodology to three medical datasets to demonstrate the efficacy of the proposed method. The results showed that our methodology outperformed the existing techniques for all the illustrated datasets. Although our focus here was on medical datasets, our proposed methodology is equally applicable to datasets in many other areas.
first_indexed 2024-03-10T07:47:51Z
format Article
id doaj.art-dd5d7d4349cd4ac6ad3f94b4d55d9398
institution Directory Open Access Journal
issn 2079-3197
language English
last_indexed 2024-03-10T07:47:51Z
publishDate 2021-09-01
publisher MDPI AG
record_format Article
series Computation
spelling doaj.art-dd5d7d4349cd4ac6ad3f94b4d55d93982023-11-22T12:33:19ZengMDPI AGComputation2079-31972021-09-01999910.3390/computation9090099Integrating Data Mining Techniques for Naïve Bayes Classification: Applications to Medical DatasetsPannapa Changpetch0Apasiri Pitpeng1Sasiprapa Hiriote2Chumpol Yuangyai3Department of Mathematics, Faculty of Science, Mahidol University, Bangkok 10400, ThailandDepartment of Mathematics, Faculty of Science, Mahidol University, Bangkok 10400, ThailandDepartment of Statistics, Faculty of Science, Silpakorn University, Nakhon Pathom 73000, ThailandDepartment of Industrial Engineering, School of Engineering, King Mongkut’s Institute of Technology Ladkrabang, Bangkok 10520, ThailandIn this study, we designed a framework in which three techniques—classification tree, association rules analysis (ASA), and the naïve Bayes classifier—were combined to improve the performance of the latter. A classification tree was used to discretize quantitative predictors into categories and ASA was used to generate interactions in a fully realized way, as discretized variables and interactions are key to improving the classification accuracy of the naïve Bayes classifier. We applied our methodology to three medical datasets to demonstrate the efficacy of the proposed method. The results showed that our methodology outperformed the existing techniques for all the illustrated datasets. Although our focus here was on medical datasets, our proposed methodology is equally applicable to datasets in many other areas.https://www.mdpi.com/2079-3197/9/9/99association rules analysisclassification treediscretizationinteractionsnaïve Bayes
spellingShingle Pannapa Changpetch
Apasiri Pitpeng
Sasiprapa Hiriote
Chumpol Yuangyai
Integrating Data Mining Techniques for Naïve Bayes Classification: Applications to Medical Datasets
Computation
association rules analysis
classification tree
discretization
interactions
naïve Bayes
title Integrating Data Mining Techniques for Naïve Bayes Classification: Applications to Medical Datasets
title_full Integrating Data Mining Techniques for Naïve Bayes Classification: Applications to Medical Datasets
title_fullStr Integrating Data Mining Techniques for Naïve Bayes Classification: Applications to Medical Datasets
title_full_unstemmed Integrating Data Mining Techniques for Naïve Bayes Classification: Applications to Medical Datasets
title_short Integrating Data Mining Techniques for Naïve Bayes Classification: Applications to Medical Datasets
title_sort integrating data mining techniques for naive bayes classification applications to medical datasets
topic association rules analysis
classification tree
discretization
interactions
naïve Bayes
url https://www.mdpi.com/2079-3197/9/9/99
work_keys_str_mv AT pannapachangpetch integratingdataminingtechniquesfornaivebayesclassificationapplicationstomedicaldatasets
AT apasiripitpeng integratingdataminingtechniquesfornaivebayesclassificationapplicationstomedicaldatasets
AT sasiprapahiriote integratingdataminingtechniquesfornaivebayesclassificationapplicationstomedicaldatasets
AT chumpolyuangyai integratingdataminingtechniquesfornaivebayesclassificationapplicationstomedicaldatasets