Machine Learning Based Classification of IoT Traffic
With the rapid expansion and widespread adoption of the Internet of Things (IoT), maintaining secure connections among active devices can be challenging. Since IoT devices are limited in power and storage, they cannot perform complex tasks, which makes them vulnerable to different types of attacks....
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Spolecnost pro radioelektronicke inzenyrstvi
2023-06-01
|
Series: | Radioengineering |
Subjects: | |
Online Access: | https://www.radioeng.cz/fulltexts/2023/23_02_0256_0263.pdf |
_version_ | 1797815116137234432 |
---|---|
author | B. Velichkovska A. Cholakoska V. Atanasovski |
author_facet | B. Velichkovska A. Cholakoska V. Atanasovski |
author_sort | B. Velichkovska |
collection | DOAJ |
description | With the rapid expansion and widespread adoption of the Internet of Things (IoT), maintaining secure connections among active devices can be challenging. Since IoT devices are limited in power and storage, they cannot perform complex tasks, which makes them vulnerable to different types of attacks. Given the volume of data generated daily, detecting anomalous behavior can be demanding. However, machine learning (ML) algorithms have proven successful in extracting complex patterns from big data, which has led to active applications in IoT. In this paper, we perform a comprehensive analysis, including 4 ML algorithms and 3 neural networks (NNs), and propose a pipeline which analyzes the influence data reduction (loss) has on the performance of these algorithms. We use random undersampling as a data reduction technique, which simulates reduced network traffic data. The pipeline investigates several degrees of data loss. The results show that models trained on the original data distribution obtain accuracy that verges on 100%. XGBoost performs best from the classic ML algorithms. From the deep learning models, the 2-layered NN provides excellent results and has sufficient depth for practical application. On the other hand, when the models are trained on the undersampled data, there is a decrease in performance, most notably in the case of NNs. The most prominent change is seen in the 4-layered NN, where the model trained on the original dataset detects attacks with a success of 93.53%, whereas the model trained on the maximally reduced data has a success of only 39.39%. |
first_indexed | 2024-03-13T08:17:55Z |
format | Article |
id | doaj.art-9bdde34b085a447583e527f79c21cc5c |
institution | Directory Open Access Journal |
issn | 1210-2512 |
language | English |
last_indexed | 2024-03-13T08:17:55Z |
publishDate | 2023-06-01 |
publisher | Spolecnost pro radioelektronicke inzenyrstvi |
record_format | Article |
series | Radioengineering |
spelling | doaj.art-9bdde34b085a447583e527f79c21cc5c2023-05-31T12:58:06ZengSpolecnost pro radioelektronicke inzenyrstviRadioengineering1210-25122023-06-01322256263Machine Learning Based Classification of IoT TrafficB. VelichkovskaA. CholakoskaV. AtanasovskiWith the rapid expansion and widespread adoption of the Internet of Things (IoT), maintaining secure connections among active devices can be challenging. Since IoT devices are limited in power and storage, they cannot perform complex tasks, which makes them vulnerable to different types of attacks. Given the volume of data generated daily, detecting anomalous behavior can be demanding. However, machine learning (ML) algorithms have proven successful in extracting complex patterns from big data, which has led to active applications in IoT. In this paper, we perform a comprehensive analysis, including 4 ML algorithms and 3 neural networks (NNs), and propose a pipeline which analyzes the influence data reduction (loss) has on the performance of these algorithms. We use random undersampling as a data reduction technique, which simulates reduced network traffic data. The pipeline investigates several degrees of data loss. The results show that models trained on the original data distribution obtain accuracy that verges on 100%. XGBoost performs best from the classic ML algorithms. From the deep learning models, the 2-layered NN provides excellent results and has sufficient depth for practical application. On the other hand, when the models are trained on the undersampled data, there is a decrease in performance, most notably in the case of NNs. The most prominent change is seen in the 4-layered NN, where the model trained on the original dataset detects attacks with a success of 93.53%, whereas the model trained on the maximally reduced data has a success of only 39.39%.https://www.radioeng.cz/fulltexts/2023/23_02_0256_0263.pdfmachine learningdeep learninginternet of things (iot)intrusion detectiontraffic modelling |
spellingShingle | B. Velichkovska A. Cholakoska V. Atanasovski Machine Learning Based Classification of IoT Traffic Radioengineering machine learning deep learning internet of things (iot) intrusion detection traffic modelling |
title | Machine Learning Based Classification of IoT Traffic |
title_full | Machine Learning Based Classification of IoT Traffic |
title_fullStr | Machine Learning Based Classification of IoT Traffic |
title_full_unstemmed | Machine Learning Based Classification of IoT Traffic |
title_short | Machine Learning Based Classification of IoT Traffic |
title_sort | machine learning based classification of iot traffic |
topic | machine learning deep learning internet of things (iot) intrusion detection traffic modelling |
url | https://www.radioeng.cz/fulltexts/2023/23_02_0256_0263.pdf |
work_keys_str_mv | AT bvelichkovska machinelearningbasedclassificationofiottraffic AT acholakoska machinelearningbasedclassificationofiottraffic AT vatanasovski machinelearningbasedclassificationofiottraffic |