An AdaBoost Method with K′K-Means Bayes Classifier for Imbalanced Data
This article proposes a new AdaBoost method with <inline-formula><math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><msup><mrow><mi mathvariant="normal">k</mi></mrow><mo>′</mo></msu...
Main Authors: | , |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI AG
2023-04-01
|
Series: | Mathematics |
Subjects: | |
Online Access: | https://www.mdpi.com/2227-7390/11/8/1878 |
_version_ | 1797604435304644608 |
---|---|
author | Yanfeng Zhang Lichun Wang |
author_facet | Yanfeng Zhang Lichun Wang |
author_sort | Yanfeng Zhang |
collection | DOAJ |
description | This article proposes a new AdaBoost method with <inline-formula><math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><msup><mrow><mi mathvariant="normal">k</mi></mrow><mo>′</mo></msup></semantics></math></inline-formula>k-means Bayes classifier for imbalanced data. It reduces the imbalance degree of training data through the <inline-formula><math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><msup><mrow><mi mathvariant="normal">k</mi></mrow><mo>′</mo></msup></semantics></math></inline-formula>k-means Bayes method and then deals with the imbalanced classification problem using multiple iterations with weight control, achieving a good effect without losing any raw data information or needing to generate more relevant data manually. The effectiveness of the proposed method is verified by comparing it with other traditional methods based on numerical experiments. In the NSL-KDD data experiment, the F-score values of each minority class are also greater than the other methods. |
first_indexed | 2024-03-11T04:46:33Z |
format | Article |
id | doaj.art-19c1039e76074e26baa4a860e370e36d |
institution | Directory Open Access Journal |
issn | 2227-7390 |
language | English |
last_indexed | 2024-03-11T04:46:33Z |
publishDate | 2023-04-01 |
publisher | MDPI AG |
record_format | Article |
series | Mathematics |
spelling | doaj.art-19c1039e76074e26baa4a860e370e36d2023-11-17T20:17:52ZengMDPI AGMathematics2227-73902023-04-01118187810.3390/math11081878An AdaBoost Method with K′K-Means Bayes Classifier for Imbalanced DataYanfeng Zhang0Lichun Wang1Department of Statistics, Beijing Jiaotong University, Beijing 100044, ChinaDepartment of Statistics, Beijing Jiaotong University, Beijing 100044, ChinaThis article proposes a new AdaBoost method with <inline-formula><math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><msup><mrow><mi mathvariant="normal">k</mi></mrow><mo>′</mo></msup></semantics></math></inline-formula>k-means Bayes classifier for imbalanced data. It reduces the imbalance degree of training data through the <inline-formula><math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><msup><mrow><mi mathvariant="normal">k</mi></mrow><mo>′</mo></msup></semantics></math></inline-formula>k-means Bayes method and then deals with the imbalanced classification problem using multiple iterations with weight control, achieving a good effect without losing any raw data information or needing to generate more relevant data manually. The effectiveness of the proposed method is verified by comparing it with other traditional methods based on numerical experiments. In the NSL-KDD data experiment, the F-score values of each minority class are also greater than the other methods.https://www.mdpi.com/2227-7390/11/8/1878imbalanced datanaive Bayesimbalanced classifiersAdaBoost method |
spellingShingle | Yanfeng Zhang Lichun Wang An AdaBoost Method with K′K-Means Bayes Classifier for Imbalanced Data Mathematics imbalanced data naive Bayes imbalanced classifiers AdaBoost method |
title | An AdaBoost Method with K′K-Means Bayes Classifier for Imbalanced Data |
title_full | An AdaBoost Method with K′K-Means Bayes Classifier for Imbalanced Data |
title_fullStr | An AdaBoost Method with K′K-Means Bayes Classifier for Imbalanced Data |
title_full_unstemmed | An AdaBoost Method with K′K-Means Bayes Classifier for Imbalanced Data |
title_short | An AdaBoost Method with K′K-Means Bayes Classifier for Imbalanced Data |
title_sort | adaboost method with k k means bayes classifier for imbalanced data |
topic | imbalanced data naive Bayes imbalanced classifiers AdaBoost method |
url | https://www.mdpi.com/2227-7390/11/8/1878 |
work_keys_str_mv | AT yanfengzhang anadaboostmethodwithkkmeansbayesclassifierforimbalanceddata AT lichunwang anadaboostmethodwithkkmeansbayesclassifierforimbalanceddata AT yanfengzhang adaboostmethodwithkkmeansbayesclassifierforimbalanceddata AT lichunwang adaboostmethodwithkkmeansbayesclassifierforimbalanceddata |