Machine Learning Based Classification of IoT Traffic

With the rapid expansion and widespread adoption of the Internet of Things (IoT), maintaining secure connections among active devices can be challenging. Since IoT devices are limited in power and storage, they cannot perform complex tasks, which makes them vulnerable to different types of attacks....

Full description

Bibliographic Details
Main Authors: B. Velichkovska, A. Cholakoska, V. Atanasovski
Format: Article
Language:English
Published: Spolecnost pro radioelektronicke inzenyrstvi 2023-06-01
Series:Radioengineering
Subjects:
Online Access:https://www.radioeng.cz/fulltexts/2023/23_02_0256_0263.pdf
_version_ 1797815116137234432
author B. Velichkovska
A. Cholakoska
V. Atanasovski
author_facet B. Velichkovska
A. Cholakoska
V. Atanasovski
author_sort B. Velichkovska
collection DOAJ
description With the rapid expansion and widespread adoption of the Internet of Things (IoT), maintaining secure connections among active devices can be challenging. Since IoT devices are limited in power and storage, they cannot perform complex tasks, which makes them vulnerable to different types of attacks. Given the volume of data generated daily, detecting anomalous behavior can be demanding. However, machine learning (ML) algorithms have proven successful in extracting complex patterns from big data, which has led to active applications in IoT. In this paper, we perform a comprehensive analysis, including 4 ML algorithms and 3 neural networks (NNs), and propose a pipeline which analyzes the influence data reduction (loss) has on the performance of these algorithms. We use random undersampling as a data reduction technique, which simulates reduced network traffic data. The pipeline investigates several degrees of data loss. The results show that models trained on the original data distribution obtain accuracy that verges on 100%. XGBoost performs best from the classic ML algorithms. From the deep learning models, the 2-layered NN provides excellent results and has sufficient depth for practical application. On the other hand, when the models are trained on the undersampled data, there is a decrease in performance, most notably in the case of NNs. The most prominent change is seen in the 4-layered NN, where the model trained on the original dataset detects attacks with a success of 93.53%, whereas the model trained on the maximally reduced data has a success of only 39.39%.
first_indexed 2024-03-13T08:17:55Z
format Article
id doaj.art-9bdde34b085a447583e527f79c21cc5c
institution Directory Open Access Journal
issn 1210-2512
language English
last_indexed 2024-03-13T08:17:55Z
publishDate 2023-06-01
publisher Spolecnost pro radioelektronicke inzenyrstvi
record_format Article
series Radioengineering
spelling doaj.art-9bdde34b085a447583e527f79c21cc5c2023-05-31T12:58:06ZengSpolecnost pro radioelektronicke inzenyrstviRadioengineering1210-25122023-06-01322256263Machine Learning Based Classification of IoT TrafficB. VelichkovskaA. CholakoskaV. AtanasovskiWith the rapid expansion and widespread adoption of the Internet of Things (IoT), maintaining secure connections among active devices can be challenging. Since IoT devices are limited in power and storage, they cannot perform complex tasks, which makes them vulnerable to different types of attacks. Given the volume of data generated daily, detecting anomalous behavior can be demanding. However, machine learning (ML) algorithms have proven successful in extracting complex patterns from big data, which has led to active applications in IoT. In this paper, we perform a comprehensive analysis, including 4 ML algorithms and 3 neural networks (NNs), and propose a pipeline which analyzes the influence data reduction (loss) has on the performance of these algorithms. We use random undersampling as a data reduction technique, which simulates reduced network traffic data. The pipeline investigates several degrees of data loss. The results show that models trained on the original data distribution obtain accuracy that verges on 100%. XGBoost performs best from the classic ML algorithms. From the deep learning models, the 2-layered NN provides excellent results and has sufficient depth for practical application. On the other hand, when the models are trained on the undersampled data, there is a decrease in performance, most notably in the case of NNs. The most prominent change is seen in the 4-layered NN, where the model trained on the original dataset detects attacks with a success of 93.53%, whereas the model trained on the maximally reduced data has a success of only 39.39%.https://www.radioeng.cz/fulltexts/2023/23_02_0256_0263.pdfmachine learningdeep learninginternet of things (iot)intrusion detectiontraffic modelling
spellingShingle B. Velichkovska
A. Cholakoska
V. Atanasovski
Machine Learning Based Classification of IoT Traffic
Radioengineering
machine learning
deep learning
internet of things (iot)
intrusion detection
traffic modelling
title Machine Learning Based Classification of IoT Traffic
title_full Machine Learning Based Classification of IoT Traffic
title_fullStr Machine Learning Based Classification of IoT Traffic
title_full_unstemmed Machine Learning Based Classification of IoT Traffic
title_short Machine Learning Based Classification of IoT Traffic
title_sort machine learning based classification of iot traffic
topic machine learning
deep learning
internet of things (iot)
intrusion detection
traffic modelling
url https://www.radioeng.cz/fulltexts/2023/23_02_0256_0263.pdf
work_keys_str_mv AT bvelichkovska machinelearningbasedclassificationofiottraffic
AT acholakoska machinelearningbasedclassificationofiottraffic
AT vatanasovski machinelearningbasedclassificationofiottraffic