Feature selection using multivariate adaptive regression splines in telecommunication fraud detection

Feature selection determines the most significant features for a given task while rejecting the noisy, irrelevant and redundant features of the dataset that might mislead the classifier. Besides, the technique diminishes the dimensionality of the attribute of the dataset, thus reducing computation t...

Full description

Bibliographic Details
Main Authors: Mohamed Amin, M., Zainal, A., Mohd. Azmi, N. F., Ali, N. A.
Format: Conference or Workshop Item
Language:English
Published: 2020
Subjects:
Online Access:http://eprints.utm.my/93090/1/MuhalimMohamedAmin2020_FeatureSelectionUsingMultivariateAdaptiveRegression.pdf
Description
Summary:Feature selection determines the most significant features for a given task while rejecting the noisy, irrelevant and redundant features of the dataset that might mislead the classifier. Besides, the technique diminishes the dimensionality of the attribute of the dataset, thus reducing computation time and improving prediction performance. This paper aims to perform a feature selection for classification more accurately with an optimal features subset using Multivariate Adaptive Regression Splines (MARS) in Spline Model (SM) classifier. A comparative study of prediction performance was conducted with other classifiers including Decision Tree (DT), Neural Network (NN) and Support Vector Machine (SVM) with similar optimal feature subset produced by MARS. From the results, the MARS technique demonstrated the features reduction up to 87.76% and improved the classification accuracy. Based on the comparative analysis conducted, the Spline classifier shows better performance by achieving the highest accuracy (97.44%) compared to other classifiers.