An AdaBoost Method with K′K-Means Bayes Classifier for Imbalanced Data

This article proposes a new AdaBoost method with <inline-formula><math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><msup><mrow><mi mathvariant="normal">k</mi></mrow><mo>′</mo></msu...

Full description

Bibliographic Details
Main Authors: Yanfeng Zhang, Lichun Wang
Format: Article
Language:English
Published: MDPI AG 2023-04-01
Series:Mathematics
Subjects:
Online Access:https://www.mdpi.com/2227-7390/11/8/1878
_version_ 1797604435304644608
author Yanfeng Zhang
Lichun Wang
author_facet Yanfeng Zhang
Lichun Wang
author_sort Yanfeng Zhang
collection DOAJ
description This article proposes a new AdaBoost method with <inline-formula><math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><msup><mrow><mi mathvariant="normal">k</mi></mrow><mo>′</mo></msup></semantics></math></inline-formula>k-means Bayes classifier for imbalanced data. It reduces the imbalance degree of training data through the <inline-formula><math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><msup><mrow><mi mathvariant="normal">k</mi></mrow><mo>′</mo></msup></semantics></math></inline-formula>k-means Bayes method and then deals with the imbalanced classification problem using multiple iterations with weight control, achieving a good effect without losing any raw data information or needing to generate more relevant data manually. The effectiveness of the proposed method is verified by comparing it with other traditional methods based on numerical experiments. In the NSL-KDD data experiment, the F-score values of each minority class are also greater than the other methods.
first_indexed 2024-03-11T04:46:33Z
format Article
id doaj.art-19c1039e76074e26baa4a860e370e36d
institution Directory Open Access Journal
issn 2227-7390
language English
last_indexed 2024-03-11T04:46:33Z
publishDate 2023-04-01
publisher MDPI AG
record_format Article
series Mathematics
spelling doaj.art-19c1039e76074e26baa4a860e370e36d2023-11-17T20:17:52ZengMDPI AGMathematics2227-73902023-04-01118187810.3390/math11081878An AdaBoost Method with K′K-Means Bayes Classifier for Imbalanced DataYanfeng Zhang0Lichun Wang1Department of Statistics, Beijing Jiaotong University, Beijing 100044, ChinaDepartment of Statistics, Beijing Jiaotong University, Beijing 100044, ChinaThis article proposes a new AdaBoost method with <inline-formula><math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><msup><mrow><mi mathvariant="normal">k</mi></mrow><mo>′</mo></msup></semantics></math></inline-formula>k-means Bayes classifier for imbalanced data. It reduces the imbalance degree of training data through the <inline-formula><math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><msup><mrow><mi mathvariant="normal">k</mi></mrow><mo>′</mo></msup></semantics></math></inline-formula>k-means Bayes method and then deals with the imbalanced classification problem using multiple iterations with weight control, achieving a good effect without losing any raw data information or needing to generate more relevant data manually. The effectiveness of the proposed method is verified by comparing it with other traditional methods based on numerical experiments. In the NSL-KDD data experiment, the F-score values of each minority class are also greater than the other methods.https://www.mdpi.com/2227-7390/11/8/1878imbalanced datanaive Bayesimbalanced classifiersAdaBoost method
spellingShingle Yanfeng Zhang
Lichun Wang
An AdaBoost Method with K′K-Means Bayes Classifier for Imbalanced Data
Mathematics
imbalanced data
naive Bayes
imbalanced classifiers
AdaBoost method
title An AdaBoost Method with K′K-Means Bayes Classifier for Imbalanced Data
title_full An AdaBoost Method with K′K-Means Bayes Classifier for Imbalanced Data
title_fullStr An AdaBoost Method with K′K-Means Bayes Classifier for Imbalanced Data
title_full_unstemmed An AdaBoost Method with K′K-Means Bayes Classifier for Imbalanced Data
title_short An AdaBoost Method with K′K-Means Bayes Classifier for Imbalanced Data
title_sort adaboost method with k k means bayes classifier for imbalanced data
topic imbalanced data
naive Bayes
imbalanced classifiers
AdaBoost method
url https://www.mdpi.com/2227-7390/11/8/1878
work_keys_str_mv AT yanfengzhang anadaboostmethodwithkkmeansbayesclassifierforimbalanceddata
AT lichunwang anadaboostmethodwithkkmeansbayesclassifierforimbalanceddata
AT yanfengzhang adaboostmethodwithkkmeansbayesclassifierforimbalanceddata
AT lichunwang adaboostmethodwithkkmeansbayesclassifierforimbalanceddata