Detecting Frauds and Payment Defaults on Credit Card Data Inherited With Imbalanced Class Distribution and Overlapping Class Problems: A Systematic Review

Credit card payments are one popular e-payment option apart from cash payments. Recent reports show that credit card fraud and payment defaults are increasing annually and are alarming. Thus, researchers have attempted various machine learning techniques to address these two challenges. However, the...

Full description

Bibliographic Details
Main Authors: Suraya Nurain Kalid, Kok-Chin Khor, Keng-Hoong Ng, Gee-Kok Tong
Format: Article
Language:English
Published: IEEE 2024-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/10423008/
_version_ 1827347030828122112
author Suraya Nurain Kalid
Kok-Chin Khor
Keng-Hoong Ng
Gee-Kok Tong
author_facet Suraya Nurain Kalid
Kok-Chin Khor
Keng-Hoong Ng
Gee-Kok Tong
author_sort Suraya Nurain Kalid
collection DOAJ
description Credit card payments are one popular e-payment option apart from cash payments. Recent reports show that credit card fraud and payment defaults are increasing annually and are alarming. Thus, researchers have attempted various machine learning techniques to address these two challenges. However, they are challenged to mitigate the two major problems inherited in credit card data: (i) imbalanced class distribution and (ii) overlapping classes. Mitigating these problems shall effectively detect credit card frauds and payment defaults, thus benefiting card issuers and holders. Hence, this paper aims to develop a systematic review using PRISMA to identify and compare various credit card datasets, machine learning techniques, and evaluation metrics. Subsequently, we provide recommendations for handling these two problems. We extracted research papers from 2016 to 2023 from ScienceDirect, Springer, Association and Computing Machinery (ACM), and IEEE databases. The papers shall be included if written in English and published in peer-reviewed and indexed journals or conference proceedings. Finally, 87 papers were selected based on the eligibility criteria. Based on our findings, the European and Taiwan datasets are widely used in the research community. However, most researchers focus on tackling imbalanced class distribution rather than two problems together. We recommended to the research community the application of deep learning, ensemble learning, and sampling methods to effectively detect fraud and payment defaults on credit card datasets that inherit the two problems. In evaluating the machine learning algorithms, we recommend using metrics that can separately evaluate the algorithms’ performance in detecting frauds/payment defaults and normal transactions.
first_indexed 2024-03-07T23:41:32Z
format Article
id doaj.art-9151ec217850430b974e1cc54d7ff5f1
institution Directory Open Access Journal
issn 2169-3536
language English
last_indexed 2024-03-07T23:41:32Z
publishDate 2024-01-01
publisher IEEE
record_format Article
series IEEE Access
spelling doaj.art-9151ec217850430b974e1cc54d7ff5f12024-02-20T00:01:10ZengIEEEIEEE Access2169-35362024-01-0112236362365210.1109/ACCESS.2024.336283110423008Detecting Frauds and Payment Defaults on Credit Card Data Inherited With Imbalanced Class Distribution and Overlapping Class Problems: A Systematic ReviewSuraya Nurain Kalid0https://orcid.org/0000-0003-4514-377XKok-Chin Khor1https://orcid.org/0000-0001-9346-1479Keng-Hoong Ng2https://orcid.org/0000-0001-8617-1086Gee-Kok Tong3https://orcid.org/0000-0002-5086-9383Faculty of Computing and Informatics, Multimedia University, Cyberjaya, Selangor, MalaysiaLee Kong Chian Faculty of Engineering and Science, Universiti Tunku Abdul Rahman, Bandar Sungai Long, Kajang, MalaysiaLee Kong Chian Faculty of Engineering and Science, Universiti Tunku Abdul Rahman, Bandar Sungai Long, Kajang, MalaysiaFaculty of Computing and Informatics, Multimedia University, Cyberjaya, Selangor, MalaysiaCredit card payments are one popular e-payment option apart from cash payments. Recent reports show that credit card fraud and payment defaults are increasing annually and are alarming. Thus, researchers have attempted various machine learning techniques to address these two challenges. However, they are challenged to mitigate the two major problems inherited in credit card data: (i) imbalanced class distribution and (ii) overlapping classes. Mitigating these problems shall effectively detect credit card frauds and payment defaults, thus benefiting card issuers and holders. Hence, this paper aims to develop a systematic review using PRISMA to identify and compare various credit card datasets, machine learning techniques, and evaluation metrics. Subsequently, we provide recommendations for handling these two problems. We extracted research papers from 2016 to 2023 from ScienceDirect, Springer, Association and Computing Machinery (ACM), and IEEE databases. The papers shall be included if written in English and published in peer-reviewed and indexed journals or conference proceedings. Finally, 87 papers were selected based on the eligibility criteria. Based on our findings, the European and Taiwan datasets are widely used in the research community. However, most researchers focus on tackling imbalanced class distribution rather than two problems together. We recommended to the research community the application of deep learning, ensemble learning, and sampling methods to effectively detect fraud and payment defaults on credit card datasets that inherit the two problems. In evaluating the machine learning algorithms, we recommend using metrics that can separately evaluate the algorithms’ performance in detecting frauds/payment defaults and normal transactions.https://ieeexplore.ieee.org/document/10423008/PRISMAcredit card fraudpayment defaultimbalanced class distributionoverlapping classes
spellingShingle Suraya Nurain Kalid
Kok-Chin Khor
Keng-Hoong Ng
Gee-Kok Tong
Detecting Frauds and Payment Defaults on Credit Card Data Inherited With Imbalanced Class Distribution and Overlapping Class Problems: A Systematic Review
IEEE Access
PRISMA
credit card fraud
payment default
imbalanced class distribution
overlapping classes
title Detecting Frauds and Payment Defaults on Credit Card Data Inherited With Imbalanced Class Distribution and Overlapping Class Problems: A Systematic Review
title_full Detecting Frauds and Payment Defaults on Credit Card Data Inherited With Imbalanced Class Distribution and Overlapping Class Problems: A Systematic Review
title_fullStr Detecting Frauds and Payment Defaults on Credit Card Data Inherited With Imbalanced Class Distribution and Overlapping Class Problems: A Systematic Review
title_full_unstemmed Detecting Frauds and Payment Defaults on Credit Card Data Inherited With Imbalanced Class Distribution and Overlapping Class Problems: A Systematic Review
title_short Detecting Frauds and Payment Defaults on Credit Card Data Inherited With Imbalanced Class Distribution and Overlapping Class Problems: A Systematic Review
title_sort detecting frauds and payment defaults on credit card data inherited with imbalanced class distribution and overlapping class problems a systematic review
topic PRISMA
credit card fraud
payment default
imbalanced class distribution
overlapping classes
url https://ieeexplore.ieee.org/document/10423008/
work_keys_str_mv AT surayanurainkalid detectingfraudsandpaymentdefaultsoncreditcarddatainheritedwithimbalancedclassdistributionandoverlappingclassproblemsasystematicreview
AT kokchinkhor detectingfraudsandpaymentdefaultsoncreditcarddatainheritedwithimbalancedclassdistributionandoverlappingclassproblemsasystematicreview
AT kenghoongng detectingfraudsandpaymentdefaultsoncreditcarddatainheritedwithimbalancedclassdistributionandoverlappingclassproblemsasystematicreview
AT geekoktong detectingfraudsandpaymentdefaultsoncreditcarddatainheritedwithimbalancedclassdistributionandoverlappingclassproblemsasystematicreview