Improved cost-sensitive representation of data for solving the imbalanced big data classification problem
Abstract Dimension reduction is a preprocessing step in machine learning for eliminating undesirable features and increasing learning accuracy. In order to reduce the redundant features, there are data representation methods, each of which has its own advantages. On the other hand, big data with imb...
Main Authors: | Mahboubeh Fattahi, Mohammad Hossein Moattar, Yahya Forghani |
---|---|
Format: | Article |
Language: | English |
Published: |
SpringerOpen
2022-05-01
|
Series: | Journal of Big Data |
Subjects: | |
Online Access: | https://doi.org/10.1186/s40537-022-00617-z |
Similar Items
-
Feature selected cost-sensitive twin SVM for imbalanced data
by: Li Xiaopeng, et al.
Published: (2020-01-01) -
Imbalanced data classification using MapReduce and relief
by: Joanna Jedrzejowicz, et al.
Published: (2018-04-01) -
A Hybrid Undersampling-SMOTE Method for Imbalanced Big Data Classification
by: Shaymaa Ahmed Razoqi, et al.
Published: (2023-12-01) -
SMOTE-BD: An Exact and Scalable Oversampling Method for Imbalanced Classification in Big Data
by: María José Basgall, et al.
Published: (2018-12-01) -
PSU: Particle Stacking Undersampling Method for Highly Imbalanced Big Data
by: Yong-Seok Jeon, et al.
Published: (2020-01-01)