Network intrusion detection using machine learning approaches: Addressing data imbalance

Abstract Cybersecurity has become a significant issue. Machine learning algorithms are known to help identify cyberattacks such as network intrusion. However, common network intrusion datasets are negatively affected by class imbalance: the normal traffic behaviour constitutes most of the dataset, w...

Full description

Bibliographic Details
Main Authors: Rahbar Ahsan, Wei Shi, Jean‐Pierre Corriveau
Format: Article
Language:English
Published: Wiley 2022-03-01
Series:IET Cyber-Physical Systems
Subjects:
Online Access:https://doi.org/10.1049/cps2.12013
_version_ 1818395252285767680
author Rahbar Ahsan
Wei Shi
Jean‐Pierre Corriveau
author_facet Rahbar Ahsan
Wei Shi
Jean‐Pierre Corriveau
author_sort Rahbar Ahsan
collection DOAJ
description Abstract Cybersecurity has become a significant issue. Machine learning algorithms are known to help identify cyberattacks such as network intrusion. However, common network intrusion datasets are negatively affected by class imbalance: the normal traffic behaviour constitutes most of the dataset, whereas intrusion traffic behaviour forms a significantly smaller portion. A comparative evaluation of the performance is conducted of several classical machine learning algorithms, as well as deep learning algorithms, on the well‐known National Security Lab Knowledge Discovery and Data Mining dataset for intrusion detection. More specifically, two variants of a fully connected neural network, one with an autoencoder and one without, have been implemented to compare their performance against seven classical machine learning algorithms. A voting classifier is also proposed to combine the decisions of these nine machine learning algorithms. All of the models are tested in combination with three different resampling techniques: oversampling, undersampling, and hybrid sampling. The details of the experiments conducted and an analysis of their results are then discussed.
first_indexed 2024-12-14T06:14:09Z
format Article
id doaj.art-30df1d85997e463ba81b07e83503792b
institution Directory Open Access Journal
issn 2398-3396
language English
last_indexed 2024-12-14T06:14:09Z
publishDate 2022-03-01
publisher Wiley
record_format Article
series IET Cyber-Physical Systems
spelling doaj.art-30df1d85997e463ba81b07e83503792b2022-12-21T23:14:05ZengWileyIET Cyber-Physical Systems2398-33962022-03-0171303910.1049/cps2.12013Network intrusion detection using machine learning approaches: Addressing data imbalanceRahbar Ahsan0Wei Shi1Jean‐Pierre Corriveau2School of Computer Science Carleton University Ottawa CanadaSchool of Information Technology Carleton University Ottawa CanadaSchool of Computer Science Carleton University Ottawa CanadaAbstract Cybersecurity has become a significant issue. Machine learning algorithms are known to help identify cyberattacks such as network intrusion. However, common network intrusion datasets are negatively affected by class imbalance: the normal traffic behaviour constitutes most of the dataset, whereas intrusion traffic behaviour forms a significantly smaller portion. A comparative evaluation of the performance is conducted of several classical machine learning algorithms, as well as deep learning algorithms, on the well‐known National Security Lab Knowledge Discovery and Data Mining dataset for intrusion detection. More specifically, two variants of a fully connected neural network, one with an autoencoder and one without, have been implemented to compare their performance against seven classical machine learning algorithms. A voting classifier is also proposed to combine the decisions of these nine machine learning algorithms. All of the models are tested in combination with three different resampling techniques: oversampling, undersampling, and hybrid sampling. The details of the experiments conducted and an analysis of their results are then discussed.https://doi.org/10.1049/cps2.12013security of datadata miningpattern classificationdeep learning (artificial intelligence)sampling methods
spellingShingle Rahbar Ahsan
Wei Shi
Jean‐Pierre Corriveau
Network intrusion detection using machine learning approaches: Addressing data imbalance
IET Cyber-Physical Systems
security of data
data mining
pattern classification
deep learning (artificial intelligence)
sampling methods
title Network intrusion detection using machine learning approaches: Addressing data imbalance
title_full Network intrusion detection using machine learning approaches: Addressing data imbalance
title_fullStr Network intrusion detection using machine learning approaches: Addressing data imbalance
title_full_unstemmed Network intrusion detection using machine learning approaches: Addressing data imbalance
title_short Network intrusion detection using machine learning approaches: Addressing data imbalance
title_sort network intrusion detection using machine learning approaches addressing data imbalance
topic security of data
data mining
pattern classification
deep learning (artificial intelligence)
sampling methods
url https://doi.org/10.1049/cps2.12013
work_keys_str_mv AT rahbarahsan networkintrusiondetectionusingmachinelearningapproachesaddressingdataimbalance
AT weishi networkintrusiondetectionusingmachinelearningapproachesaddressingdataimbalance
AT jeanpierrecorriveau networkintrusiondetectionusingmachinelearningapproachesaddressingdataimbalance