Machine Learning Based Classification of IoT Traffic

With the rapid expansion and widespread adoption of the Internet of Things (IoT), maintaining secure connections among active devices can be challenging. Since IoT devices are limited in power and storage, they cannot perform complex tasks, which makes them vulnerable to different types of attacks....

Full description

Bibliographic Details
Main Authors:	B. Velichkovska, A. Cholakoska, V. Atanasovski
Format:	Article
Language:	English
Published:	Spolecnost pro radioelektronicke inzenyrstvi 2023-06-01
Series:	Radioengineering
Subjects:	machine learning deep learning internet of things (iot) intrusion detection traffic modelling
Online Access:	https://www.radioeng.cz/fulltexts/2023/23_02_0256_0263.pdf

_version_	1827936976326623232
author	B. Velichkovska A. Cholakoska V. Atanasovski
author_facet	B. Velichkovska A. Cholakoska V. Atanasovski
author_sort	B. Velichkovska
collection	DOAJ
description	With the rapid expansion and widespread adoption of the Internet of Things (IoT), maintaining secure connections among active devices can be challenging. Since IoT devices are limited in power and storage, they cannot perform complex tasks, which makes them vulnerable to different types of attacks. Given the volume of data generated daily, detecting anomalous behavior can be demanding. However, machine learning (ML) algorithms have proven successful in extracting complex patterns from big data, which has led to active applications in IoT. In this paper, we perform a comprehensive analysis, including 4 ML algorithms and 3 neural networks (NNs), and propose a pipeline which analyzes the influence data reduction (loss) has on the performance of these algorithms. We use random undersampling as a data reduction technique, which simulates reduced network traffic data. The pipeline investigates several degrees of data loss. The results show that models trained on the original data distribution obtain accuracy that verges on 100%. XGBoost performs best from the classic ML algorithms. From the deep learning models, the 2-layered NN provides excellent results and has sufficient depth for practical application. On the other hand, when the models are trained on the undersampled data, there is a decrease in performance, most notably in the case of NNs. The most prominent change is seen in the 4-layered NN, where the model trained on the original dataset detects attacks with a success of 93.53%, whereas the model trained on the maximally reduced data has a success of only 39.39%.
first_indexed	2024-03-13T08:17:55Z
format	Article
id	doaj.art-9bdde34b085a447583e527f79c21cc5c
institution	Directory Open Access Journal
issn	1210-2512
language	English
last_indexed	2024-03-13T08:17:55Z
publishDate	2023-06-01
publisher	Spolecnost pro radioelektronicke inzenyrstvi
record_format	Article
series	Radioengineering
spelling	doaj.art-9bdde34b085a447583e527f79c21cc5c2023-05-31T12:58:06ZengSpolecnost pro radioelektronicke inzenyrstviRadioengineering1210-25122023-06-01322256263Machine Learning Based Classification of IoT TrafficB. VelichkovskaA. CholakoskaV. AtanasovskiWith the rapid expansion and widespread adoption of the Internet of Things (IoT), maintaining secure connections among active devices can be challenging. Since IoT devices are limited in power and storage, they cannot perform complex tasks, which makes them vulnerable to different types of attacks. Given the volume of data generated daily, detecting anomalous behavior can be demanding. However, machine learning (ML) algorithms have proven successful in extracting complex patterns from big data, which has led to active applications in IoT. In this paper, we perform a comprehensive analysis, including 4 ML algorithms and 3 neural networks (NNs), and propose a pipeline which analyzes the influence data reduction (loss) has on the performance of these algorithms. We use random undersampling as a data reduction technique, which simulates reduced network traffic data. The pipeline investigates several degrees of data loss. The results show that models trained on the original data distribution obtain accuracy that verges on 100%. XGBoost performs best from the classic ML algorithms. From the deep learning models, the 2-layered NN provides excellent results and has sufficient depth for practical application. On the other hand, when the models are trained on the undersampled data, there is a decrease in performance, most notably in the case of NNs. The most prominent change is seen in the 4-layered NN, where the model trained on the original dataset detects attacks with a success of 93.53%, whereas the model trained on the maximally reduced data has a success of only 39.39%.https://www.radioeng.cz/fulltexts/2023/23_02_0256_0263.pdfmachine learningdeep learninginternet of things (iot)intrusion detectiontraffic modelling
spellingShingle	B. Velichkovska A. Cholakoska V. Atanasovski Machine Learning Based Classification of IoT Traffic Radioengineering machine learning deep learning internet of things (iot) intrusion detection traffic modelling
title	Machine Learning Based Classification of IoT Traffic
title_full	Machine Learning Based Classification of IoT Traffic
title_fullStr	Machine Learning Based Classification of IoT Traffic
title_full_unstemmed	Machine Learning Based Classification of IoT Traffic
title_short	Machine Learning Based Classification of IoT Traffic
title_sort	machine learning based classification of iot traffic
topic	machine learning deep learning internet of things (iot) intrusion detection traffic modelling
url	https://www.radioeng.cz/fulltexts/2023/23_02_0256_0263.pdf
work_keys_str_mv	AT bvelichkovska machinelearningbasedclassificationofiottraffic AT acholakoska machinelearningbasedclassificationofiottraffic AT vatanasovski machinelearningbasedclassificationofiottraffic

Machine Learning Based Classification of IoT Traffic

Similar Items