Robust network anomaly detection using ensemble learning approach and explainable artificial intelligence (XAI)

Intrusion Detection Systems, specifically Network Anomaly Detection Systems (NADSs) are vital tools in network security. The NADSs are affected by data imbalance issues in classifying minority classes. Also, designing an efficient detection framework is sought after to achieve a higher detection rat...

Full description

Bibliographic Details
Main Authors: Mohammad Kazim Hooshmand, Manjaiah Doddaghatta Huchaiah, Ahmad Reda Alzighaibi, Hasan Hashim, El-Sayed Atlam, Ibrahim Gad
Format: Article
Language:English
Published: Elsevier 2024-05-01
Series:Alexandria Engineering Journal
Subjects:
Online Access:http://www.sciencedirect.com/science/article/pii/S1110016824002850
_version_ 1797203070925406208
author Mohammad Kazim Hooshmand
Manjaiah Doddaghatta Huchaiah
Ahmad Reda Alzighaibi
Hasan Hashim
El-Sayed Atlam
Ibrahim Gad
author_facet Mohammad Kazim Hooshmand
Manjaiah Doddaghatta Huchaiah
Ahmad Reda Alzighaibi
Hasan Hashim
El-Sayed Atlam
Ibrahim Gad
author_sort Mohammad Kazim Hooshmand
collection DOAJ
description Intrusion Detection Systems, specifically Network Anomaly Detection Systems (NADSs) are vital tools in network security. The NADSs are affected by data imbalance issues in classifying minority classes. Also, designing an efficient detection framework is sought after to achieve a higher detection rate for minority classes, especially when utilizing ensemble learning methods. To solve the issue of imbalanced data, a hybrid method of sampling techniques is proposed. This imbalance processing tool integrates the Synthetic Minority Oversampling Technique (SMOTE) and the K-means clustering algorithm (SKM). SMOTE over-samples the minority class, and K-means is used to perform a cluster-based under-sampling. We use Denoising Autoencoder (DAE) to select the top 15 features to reduce data dimensionality based on their higher weights. For anomaly detection, the XGBoost algorithm is deployed and the SHapley Additive exPlanation (SHAP) approach is deployed to provide explanations of the proposed techniques. The performance of the SKM-XGB model is assessed using the NSL-KDD and UNSW-NB15 datasets. A comparative analysis and series of experiments were carried out using several ensemble models with multiple base classifiers. The experimental findings indicate that the model's detection rate for binary classification and multiclass classification using the UNSW-NB15 dataset is 99.01% and 97.49%, respectively. The model achieves a 99.37% detection rate for binary classification and a 99.22% detection rate for multiclass classification on the NSL-KDD dataset. We conducted a comparative analysis of various ensemble models with multiple base classifiers. The results indicate that SKM-XGB outperforms the other investigated models and outperforms the performance of state-of-the-art models.
first_indexed 2024-04-24T08:13:30Z
format Article
id doaj.art-06154773b1ae4c3eb694a004117a9346
institution Directory Open Access Journal
issn 1110-0168
language English
last_indexed 2024-04-24T08:13:30Z
publishDate 2024-05-01
publisher Elsevier
record_format Article
series Alexandria Engineering Journal
spelling doaj.art-06154773b1ae4c3eb694a004117a93462024-04-17T04:48:40ZengElsevierAlexandria Engineering Journal1110-01682024-05-0194120130Robust network anomaly detection using ensemble learning approach and explainable artificial intelligence (XAI)Mohammad Kazim Hooshmand0Manjaiah Doddaghatta Huchaiah1Ahmad Reda Alzighaibi2Hasan Hashim3El-Sayed Atlam4Ibrahim Gad5Department of Computer Science, Kabul Education University, Kabul, Afghanistan; Department of Computer Science, Mangalore University, Mangalore, IndiaDepartment of Computer Science, Mangalore University, Mangalore, IndiaCollege of Computer Science and Engineering, Taibah University, Yanbu, Saudi ArabiaCollege of Computer Science and Engineering, Taibah University, Yanbu, Saudi ArabiaCollege of Computer Science and Engineering, Taibah University, Yanbu, Saudi Arabia; Department of Computer Science, Faculty of Science, Tanta University, Egypt; Corresponding author.Department of Computer Science, Faculty of Science, Tanta University, EgyptIntrusion Detection Systems, specifically Network Anomaly Detection Systems (NADSs) are vital tools in network security. The NADSs are affected by data imbalance issues in classifying minority classes. Also, designing an efficient detection framework is sought after to achieve a higher detection rate for minority classes, especially when utilizing ensemble learning methods. To solve the issue of imbalanced data, a hybrid method of sampling techniques is proposed. This imbalance processing tool integrates the Synthetic Minority Oversampling Technique (SMOTE) and the K-means clustering algorithm (SKM). SMOTE over-samples the minority class, and K-means is used to perform a cluster-based under-sampling. We use Denoising Autoencoder (DAE) to select the top 15 features to reduce data dimensionality based on their higher weights. For anomaly detection, the XGBoost algorithm is deployed and the SHapley Additive exPlanation (SHAP) approach is deployed to provide explanations of the proposed techniques. The performance of the SKM-XGB model is assessed using the NSL-KDD and UNSW-NB15 datasets. A comparative analysis and series of experiments were carried out using several ensemble models with multiple base classifiers. The experimental findings indicate that the model's detection rate for binary classification and multiclass classification using the UNSW-NB15 dataset is 99.01% and 97.49%, respectively. The model achieves a 99.37% detection rate for binary classification and a 99.22% detection rate for multiclass classification on the NSL-KDD dataset. We conducted a comparative analysis of various ensemble models with multiple base classifiers. The results indicate that SKM-XGB outperforms the other investigated models and outperforms the performance of state-of-the-art models.http://www.sciencedirect.com/science/article/pii/S1110016824002850IDSNetwork anomaly detection systemsEnsemble learningXGBoostSMOTEOversampling
spellingShingle Mohammad Kazim Hooshmand
Manjaiah Doddaghatta Huchaiah
Ahmad Reda Alzighaibi
Hasan Hashim
El-Sayed Atlam
Ibrahim Gad
Robust network anomaly detection using ensemble learning approach and explainable artificial intelligence (XAI)
Alexandria Engineering Journal
IDS
Network anomaly detection systems
Ensemble learning
XGBoost
SMOTE
Oversampling
title Robust network anomaly detection using ensemble learning approach and explainable artificial intelligence (XAI)
title_full Robust network anomaly detection using ensemble learning approach and explainable artificial intelligence (XAI)
title_fullStr Robust network anomaly detection using ensemble learning approach and explainable artificial intelligence (XAI)
title_full_unstemmed Robust network anomaly detection using ensemble learning approach and explainable artificial intelligence (XAI)
title_short Robust network anomaly detection using ensemble learning approach and explainable artificial intelligence (XAI)
title_sort robust network anomaly detection using ensemble learning approach and explainable artificial intelligence xai
topic IDS
Network anomaly detection systems
Ensemble learning
XGBoost
SMOTE
Oversampling
url http://www.sciencedirect.com/science/article/pii/S1110016824002850
work_keys_str_mv AT mohammadkazimhooshmand robustnetworkanomalydetectionusingensemblelearningapproachandexplainableartificialintelligencexai
AT manjaiahdoddaghattahuchaiah robustnetworkanomalydetectionusingensemblelearningapproachandexplainableartificialintelligencexai
AT ahmadredaalzighaibi robustnetworkanomalydetectionusingensemblelearningapproachandexplainableartificialintelligencexai
AT hasanhashim robustnetworkanomalydetectionusingensemblelearningapproachandexplainableartificialintelligencexai
AT elsayedatlam robustnetworkanomalydetectionusingensemblelearningapproachandexplainableartificialintelligencexai
AT ibrahimgad robustnetworkanomalydetectionusingensemblelearningapproachandexplainableartificialintelligencexai