Iterative cleaning and learning of big highly-imbalanced fraud data using unsupervised learning

Abstract Fraud datasets often times lack consistent and accurate labels, and are characterized by having high class imbalance where the number of fraudulent examples are far fewer than those of normal ones. Machine learning designed for effectively detecting fraud is an important task since fraudule...

Full description

Bibliographic Details
Main Authors: Robert K. L. Kennedy, Zahra Salekshahrezaee, Flavio Villanustre, Taghi M. Khoshgoftaar
Format: Article
Language:English
Published: SpringerOpen 2023-06-01
Series:Journal of Big Data
Subjects:
Online Access:https://doi.org/10.1186/s40537-023-00750-3