Robust pridit scoring method for classification fraud cases in financial data

Thesis (PhD. (Mathematics))

Bibliographic Details
Main Author:	Tukiman, Norbaiti
Format:	Thesis
Language:	English
Published:	Universiti Teknologi Malaysia 2023
Subjects:	Business enterprises > Finance Robust optimization Fraud
Online Access:	http://openscience.utm.my/handle/123456789/427

_version_	1796848861797416960
author	Tukiman, Norbaiti
author_facet	Tukiman, Norbaiti
author_sort	Tukiman, Norbaiti
collection	OpenScience
description	Thesis (PhD. (Mathematics))
first_indexed	2024-03-04T10:32:54Z
format	Thesis
id	oai:openscience.utm.my:123456789/427
institution	Universiti Teknologi Malaysia - OpenScience
language	English
last_indexed	2024-03-04T10:32:54Z
publishDate	2023
publisher	Universiti Teknologi Malaysia
record_format	dspace
spelling	oai:openscience.utm.my:123456789/4272023-09-24T14:22:20Z Robust pridit scoring method for classification fraud cases in financial data Tukiman, Norbaiti Business enterprises--Finance Robust optimization Fraud Thesis (PhD. (Mathematics)) Increasing number of fraud cases could jeopardize business solvency. Identification of fraud using effective statistical methods, such as classification, can protect organisations from this pitfall. However, identifying fraud cases can be a statistical challenge due to several characteristics of financial datasets. These data typically form large datasets that are highly dimensional, contain mixed data types and can involve an imbalanced number of fraud and non-fraud cases. This study employed the Principal Component Analysis (PCA) based on Relative to an Identified Distribution (RIDIT) scores, known as the PRIDIT method, to classify and identify data that could potentially be fraudulent cases. The classical PRIDIT method involves the transformation of each analysed dataset into a probability scale, RIDIT score. PCA is then employed to the RIDIT score data matrix to capture the highest variability in the dataset. However, the classical PRIDIT method framework has a limitation in the form of the PCA based Pearson correlation's measures being insensitive to the variability of the data. In addition, there are no specific measurements for assessing the PRIDIT method's performance under different data characteristics. Hence, this study proposed a robust PRIDIT methodology framework by incorporating several robust estimators (M-Huber, M-Tukey Bisquare, MM and LTS estimators) to improve the performance of classification tasks in identifying potentially fraudulent case data. The proposed method is applied on a German Credit Card Dataset. The analysis indicates that the highest accuracy rate of 48.5% was obtained by robust PRIDIT based on M-Tukey Bisquare estimator, followed by the results of robust PRIDIT based on MM and LTS estimators, which show similar accuracy scores of 48.1% with classical PRIDIT. The lowest accuracy score was obtained by robust PRIDIT based on M-Huber at 47.9%. A simulation study was also conducted to assess the performance of different PRIDIT methods. Behaviours of different PRIDIT methods were observed under different credibility percentage settings (Non-Fraud (NF); Fraud (F) cases, 95%NF;5%F, 90%NF;10%F, 80%NF;20%F and 70%NF;30%F) and variability levels (low, medium and high) in the datasets. The simulation results show that the accuracy rate obtained by classical PRIDIT, robust PRIDIT based M-Tukey Bisquare, MM, LTS and Huber are 64.3%, 65.3%, 65%, 63.7% and 61.7% respectively at credibility setting (70%NF;30%F) and medium variability. Thus, the findings indicate that the robust PRIDIT based on M-Tukey Bisquare outperform the other estimators by achieving the highest accuracy rate of 65.3%. In addition, the robust PRIDIT method also has a better rate of accuracy when data variability is medium or high compared to the classical PRIDIT method. Thus, this study has introduced a new method using robust PRIDIT to assess the credibility of financial data effectively. Faculty of Science 2023-07-16T07:45:05Z 2023-07-16T07:45:05Z 2022 Thesis Dataset http://openscience.utm.my/handle/123456789/427 en application/pdf application/pdf application/pdf Universiti Teknologi Malaysia
spellingShingle	Business enterprises--Finance Robust optimization Fraud Tukiman, Norbaiti Robust pridit scoring method for classification fraud cases in financial data
title	Robust pridit scoring method for classification fraud cases in financial data
title_full	Robust pridit scoring method for classification fraud cases in financial data
title_fullStr	Robust pridit scoring method for classification fraud cases in financial data
title_full_unstemmed	Robust pridit scoring method for classification fraud cases in financial data
title_short	Robust pridit scoring method for classification fraud cases in financial data
title_sort	robust pridit scoring method for classification fraud cases in financial data
topic	Business enterprises--Finance Robust optimization Fraud
url	http://openscience.utm.my/handle/123456789/427
work_keys_str_mv	AT tukimannorbaiti robustpriditscoringmethodforclassificationfraudcasesinfinancialdata

Robust pridit scoring method for classification fraud cases in financial data

Similar Items