A Learning Objective Controllable Sphere-Based Method for Balanced and Imbalanced Data Classification

Imbalanced data classification is one of the most important tasks in the field of machine learning because abnormality, which is usually of our interest, appears less frequently than normality in real-world systems. Learning classifiers from imbalanced data can be troublesome due to no absolute stan...

Full description

Bibliographic Details
Main Authors: Yeontark Park, Jong-Seok Lee
Format: Article
Language:English
Published: IEEE 2021-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/9625925/
_version_ 1818460816214589440
author Yeontark Park
Jong-Seok Lee
author_facet Yeontark Park
Jong-Seok Lee
author_sort Yeontark Park
collection DOAJ
description Imbalanced data classification is one of the most important tasks in the field of machine learning because abnormality, which is usually of our interest, appears less frequently than normality in real-world systems. Learning classifiers from imbalanced data can be troublesome due to no absolute standard as to how much imbalance can be said to be imbalanced or balanced. To address this issue, this research proposes a new sphere-based classification method named LOCS (learning objective controllable sphere-based classifier), which is designed to maximize AUC (area under ROC curve). The AUC learning objective was adopted from the fact that it approximates the accuracy as class distribution becomes balanced. Therefore, the proposed method properly performs a classification task for both imbalanced and balanced data. It constructs a classification model by a single training, whereas existing cost-sensitive learning and resampling methods usually attempt different parameter settings. In addition, the learning objective can be easily modified within LOCS for each of application domains by setting different importance levels for positive and negative classes, respectively. Numerical experiments based on 25 real datasets with several investigational settings showed the effectiveness and the intended strengths of the proposed method.
first_indexed 2024-12-14T23:36:15Z
format Article
id doaj.art-f94a5f2921364ab09353bf9e527dd376
institution Directory Open Access Journal
issn 2169-3536
language English
last_indexed 2024-12-14T23:36:15Z
publishDate 2021-01-01
publisher IEEE
record_format Article
series IEEE Access
spelling doaj.art-f94a5f2921364ab09353bf9e527dd3762022-12-21T22:43:38ZengIEEEIEEE Access2169-35362021-01-01915801015802610.1109/ACCESS.2021.31302729625925A Learning Objective Controllable Sphere-Based Method for Balanced and Imbalanced Data ClassificationYeontark Park0Jong-Seok Lee1https://orcid.org/0000-0001-5255-4425Department of Industrial Engineering, Sungkyunkwan University, Suwon, Republic of KoreaDepartment of Industrial Engineering, Sungkyunkwan University, Suwon, Republic of KoreaImbalanced data classification is one of the most important tasks in the field of machine learning because abnormality, which is usually of our interest, appears less frequently than normality in real-world systems. Learning classifiers from imbalanced data can be troublesome due to no absolute standard as to how much imbalance can be said to be imbalanced or balanced. To address this issue, this research proposes a new sphere-based classification method named LOCS (learning objective controllable sphere-based classifier), which is designed to maximize AUC (area under ROC curve). The AUC learning objective was adopted from the fact that it approximates the accuracy as class distribution becomes balanced. Therefore, the proposed method properly performs a classification task for both imbalanced and balanced data. It constructs a classification model by a single training, whereas existing cost-sensitive learning and resampling methods usually attempt different parameter settings. In addition, the learning objective can be easily modified within LOCS for each of application domains by setting different importance levels for positive and negative classes, respectively. Numerical experiments based on 25 real datasets with several investigational settings showed the effectiveness and the intended strengths of the proposed method.https://ieeexplore.ieee.org/document/9625925/Classificationclass imbalancesphere coveringlearning objectivearea under ROC curve
spellingShingle Yeontark Park
Jong-Seok Lee
A Learning Objective Controllable Sphere-Based Method for Balanced and Imbalanced Data Classification
IEEE Access
Classification
class imbalance
sphere covering
learning objective
area under ROC curve
title A Learning Objective Controllable Sphere-Based Method for Balanced and Imbalanced Data Classification
title_full A Learning Objective Controllable Sphere-Based Method for Balanced and Imbalanced Data Classification
title_fullStr A Learning Objective Controllable Sphere-Based Method for Balanced and Imbalanced Data Classification
title_full_unstemmed A Learning Objective Controllable Sphere-Based Method for Balanced and Imbalanced Data Classification
title_short A Learning Objective Controllable Sphere-Based Method for Balanced and Imbalanced Data Classification
title_sort learning objective controllable sphere based method for balanced and imbalanced data classification
topic Classification
class imbalance
sphere covering
learning objective
area under ROC curve
url https://ieeexplore.ieee.org/document/9625925/
work_keys_str_mv AT yeontarkpark alearningobjectivecontrollablespherebasedmethodforbalancedandimbalanceddataclassification
AT jongseoklee alearningobjectivecontrollablespherebasedmethodforbalancedandimbalanceddataclassification
AT yeontarkpark learningobjectivecontrollablespherebasedmethodforbalancedandimbalanceddataclassification
AT jongseoklee learningobjectivecontrollablespherebasedmethodforbalancedandimbalanceddataclassification