Improved feature selection and stream traffic classification based on machine learning in software-defined networks

Traffic classification (TC) in software-defined networks (SDN) using machine learning (ML) appears to be a viable option for improving network management. TC improves SDN operability, while SDN speeds up the feature selection (FS) process, especially when ML is used as a classification mechanism to...

Full description

Bibliographic Details
Main Authors: Arwa M. Eldhai, Mosab Hamdan, Ahmed Abdelaziz, Ibrahim Abaker Targio Hashem, Sharief F. Babiker, M. N. Marsono, Muzaffar Hamzah, Noor Zaman Jhanjhi
Format: Article
Language:English
English
Published: IEEE 2024
Subjects:
Online Access:https://eprints.ums.edu.my/id/eprint/41491/1/ABSTRACT.pdf
https://eprints.ums.edu.my/id/eprint/41491/2/FULL%20TEXT.pdf
_version_ 1817926543640363008
author Arwa M. Eldhai
Mosab Hamdan
Ahmed Abdelaziz
Ibrahim Abaker Targio Hashem
Sharief F. Babiker
M. N. Marsono
Muzaffar Hamzah
Noor Zaman Jhanjhi
author_facet Arwa M. Eldhai
Mosab Hamdan
Ahmed Abdelaziz
Ibrahim Abaker Targio Hashem
Sharief F. Babiker
M. N. Marsono
Muzaffar Hamzah
Noor Zaman Jhanjhi
author_sort Arwa M. Eldhai
collection UMS
description Traffic classification (TC) in software-defined networks (SDN) using machine learning (ML) appears to be a viable option for improving network management. TC improves SDN operability, while SDN speeds up the feature selection (FS) process, especially when ML is used as a classification mechanism to extract measurements and related information from incoming data to the SDN controller. Despite these advantages, there is still a lack of adequate support for TC and FS tasks due to the frequent similarity of traffic profiles, making classification difficult. Furthermore, when combined with TC, stream learning (SL) poses numerous challenges. As a result, robust statistical flow features are needed to reduce the overhead of the SDN control plane. As a result, these statistical flow features could extract online features, handle concept drift, and process an infinite data stream using limited resources (time and memory). This paper aims to improve the overall performance of TC using the SL technique to select relevant FS to alleviate load from the SDN control plane by doing the following. First, an FS mechanism called Boruta is proposed. Second, we propose three streaming-based TC methods for SDN: Hoeffding adaptive trees (HAT), adaptive random forest (ARF), and k-nearest neighbour with adaptive sliding window detector (KNN-ADWIN). These techniques can dynamically handle the concept drift and solve the problem of memory and time consumption, lowering the overhead of the SDN controller. Third, real and synthetic traffic traces are used to evaluate the proposed FS and streaming TC performance. According to simulation results, the Boruta FS technique can achieve up to 95% average accuracy and up to 87% average per application in terms of precision, recall, and f-score, outperforming other works in the literature. Furthermore, results for SL techniques show that the proposed methods can maintain up to 85% average accuracy, 78% kappa, and average rates of 62-88% in precision, recall, and f-score. In addition, when compared to ART and KNN-ADWIN, the HAT consumes less time and memory (15s and 105KB, respectively).
first_indexed 2024-12-09T00:52:43Z
format Article
id ums.eprints-41491
institution Universiti Malaysia Sabah
language English
English
last_indexed 2024-12-09T00:52:43Z
publishDate 2024
publisher IEEE
record_format dspace
spelling ums.eprints-414912024-10-18T07:54:37Z https://eprints.ums.edu.my/id/eprint/41491/ Improved feature selection and stream traffic classification based on machine learning in software-defined networks Arwa M. Eldhai Mosab Hamdan Ahmed Abdelaziz Ibrahim Abaker Targio Hashem Sharief F. Babiker M. N. Marsono Muzaffar Hamzah Noor Zaman Jhanjhi QA75.5-76.95 Electronic computers. Computer science TK7885-7895 Computer engineering. Computer hardware Traffic classification (TC) in software-defined networks (SDN) using machine learning (ML) appears to be a viable option for improving network management. TC improves SDN operability, while SDN speeds up the feature selection (FS) process, especially when ML is used as a classification mechanism to extract measurements and related information from incoming data to the SDN controller. Despite these advantages, there is still a lack of adequate support for TC and FS tasks due to the frequent similarity of traffic profiles, making classification difficult. Furthermore, when combined with TC, stream learning (SL) poses numerous challenges. As a result, robust statistical flow features are needed to reduce the overhead of the SDN control plane. As a result, these statistical flow features could extract online features, handle concept drift, and process an infinite data stream using limited resources (time and memory). This paper aims to improve the overall performance of TC using the SL technique to select relevant FS to alleviate load from the SDN control plane by doing the following. First, an FS mechanism called Boruta is proposed. Second, we propose three streaming-based TC methods for SDN: Hoeffding adaptive trees (HAT), adaptive random forest (ARF), and k-nearest neighbour with adaptive sliding window detector (KNN-ADWIN). These techniques can dynamically handle the concept drift and solve the problem of memory and time consumption, lowering the overhead of the SDN controller. Third, real and synthetic traffic traces are used to evaluate the proposed FS and streaming TC performance. According to simulation results, the Boruta FS technique can achieve up to 95% average accuracy and up to 87% average per application in terms of precision, recall, and f-score, outperforming other works in the literature. Furthermore, results for SL techniques show that the proposed methods can maintain up to 85% average accuracy, 78% kappa, and average rates of 62-88% in precision, recall, and f-score. In addition, when compared to ART and KNN-ADWIN, the HAT consumes less time and memory (15s and 105KB, respectively). IEEE 2024 Article NonPeerReviewed text en https://eprints.ums.edu.my/id/eprint/41491/1/ABSTRACT.pdf text en https://eprints.ums.edu.my/id/eprint/41491/2/FULL%20TEXT.pdf Arwa M. Eldhai and Mosab Hamdan and Ahmed Abdelaziz and Ibrahim Abaker Targio Hashem and Sharief F. Babiker and M. N. Marsono and Muzaffar Hamzah and Noor Zaman Jhanjhi (2024) Improved feature selection and stream traffic classification based on machine learning in software-defined networks. IEEE Access, 12. pp. 34141-34159. ISSN 2169-3536
spellingShingle QA75.5-76.95 Electronic computers. Computer science
TK7885-7895 Computer engineering. Computer hardware
Arwa M. Eldhai
Mosab Hamdan
Ahmed Abdelaziz
Ibrahim Abaker Targio Hashem
Sharief F. Babiker
M. N. Marsono
Muzaffar Hamzah
Noor Zaman Jhanjhi
Improved feature selection and stream traffic classification based on machine learning in software-defined networks
title Improved feature selection and stream traffic classification based on machine learning in software-defined networks
title_full Improved feature selection and stream traffic classification based on machine learning in software-defined networks
title_fullStr Improved feature selection and stream traffic classification based on machine learning in software-defined networks
title_full_unstemmed Improved feature selection and stream traffic classification based on machine learning in software-defined networks
title_short Improved feature selection and stream traffic classification based on machine learning in software-defined networks
title_sort improved feature selection and stream traffic classification based on machine learning in software defined networks
topic QA75.5-76.95 Electronic computers. Computer science
TK7885-7895 Computer engineering. Computer hardware
url https://eprints.ums.edu.my/id/eprint/41491/1/ABSTRACT.pdf
https://eprints.ums.edu.my/id/eprint/41491/2/FULL%20TEXT.pdf
work_keys_str_mv AT arwameldhai improvedfeatureselectionandstreamtrafficclassificationbasedonmachinelearninginsoftwaredefinednetworks
AT mosabhamdan improvedfeatureselectionandstreamtrafficclassificationbasedonmachinelearninginsoftwaredefinednetworks
AT ahmedabdelaziz improvedfeatureselectionandstreamtrafficclassificationbasedonmachinelearninginsoftwaredefinednetworks
AT ibrahimabakertargiohashem improvedfeatureselectionandstreamtrafficclassificationbasedonmachinelearninginsoftwaredefinednetworks
AT sharieffbabiker improvedfeatureselectionandstreamtrafficclassificationbasedonmachinelearninginsoftwaredefinednetworks
AT mnmarsono improvedfeatureselectionandstreamtrafficclassificationbasedonmachinelearninginsoftwaredefinednetworks
AT muzaffarhamzah improvedfeatureselectionandstreamtrafficclassificationbasedonmachinelearninginsoftwaredefinednetworks
AT noorzamanjhanjhi improvedfeatureselectionandstreamtrafficclassificationbasedonmachinelearninginsoftwaredefinednetworks