A novel feature engineered-CatBoost-based supervised machine learning framework for electricity theft detection

This paper presents a novel supervised machine learning-based electric theft detection approach using the feature engineered-CatBoost algorithm in conjunction with the SMOTETomek algorithm. Contrary to the previous literature, where the missing observations in data are either ignored or imputed with...

Full description

Bibliographic Details
Main Authors: Hussain, Saddam, Mustafa, Mohd. Wazir, A. Jumani, Touqeer, Baloch, Shadi Khan, Alotaibi, Hammad, Khan, Ilyas, Khan, Afrasyab
Format: Article
Language:English
Published: Elsevier Ltd 2021
Subjects:
Online Access:http://eprints.utm.my/95358/1/SaddamHussain2021_ANovelFeatureEngineeredCatBoost.pdf
Description
Summary:This paper presents a novel supervised machine learning-based electric theft detection approach using the feature engineered-CatBoost algorithm in conjunction with the SMOTETomek algorithm. Contrary to the previous literature, where the missing observations in data are either ignored or imputed with average values, this work utilizes k-Nearest neighbor technique for missing data imputation; thus, an accurate and realistic estimation of the missing data is achieved. To mitigate the biasness to the majority data class, the proposed model utilizes the SMOTETomek algorithm, which neutralizes the mentioned effect by managing a proper balance between over-sampling and under-sampling techniques.