Detecting representative data and generating synthetic samples to improve learning accuracy with imbalanced data sets.

It is difficult for learning models to achieve high classification performances with imbalanced data sets, because with imbalanced data sets, when one of the classes is much larger than the others, most machine learning and data mining classifiers are overly influenced by the larger classes and igno...

Full description

Bibliographic Details
Main Authors: Der-Chiang Li, Susan C Hu, Liang-Sian Lin, Chun-Wu Yeh
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2017-01-01
Series:PLoS ONE
Online Access:https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0181853&type=printable