RFSE-GRU: Data Balanced Classification Model for Mobile Encrypted Traffic in Big Data Environment

With the widespread use of mobile technologies and the Internet, traffic in mobile networks is increasing. This situation has made the classification of traffic an important element for data security and network management. However, encryption of traffic in modern networks makes it difficult to clas...

Full description

Bibliographic Details
Main Authors: Murat Dener, Samed Al, Gokce Ok
Format: Article
Language:English
Published: IEEE 2023-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/10057377/
_version_ 1811155580153233408
author Murat Dener
Samed Al
Gokce Ok
author_facet Murat Dener
Samed Al
Gokce Ok
author_sort Murat Dener
collection DOAJ
description With the widespread use of mobile technologies and the Internet, traffic in mobile networks is increasing. This situation has made the classification of traffic an important element for data security and network management. However, encryption of traffic in modern networks makes it difficult to classify traffic with traditional methods. In this study, a unique deep learning-based classification model is proposed for the classification of encrypted mobile traffic data. The proposed model is a classification model called RFSE-GRU, which combines the Gated Recurrent Units (GRU) algorithm, feature selection and data balancing. The features that are more meaningful in the classification process are determined by selecting the features with the Random Forest algorithm. In addition, Synthetic Minority Oversampling Technique (SMOTE) oversampling algorithm and Edited Nearest Neighbor (ENN) undersampling algorithm were used together to reduce the negative impact of data imbalance on classification performance. The study was carried out on Apache Spark’s big data platform in the Google Colab environment. In the study, ISCX VPN-Non VPN and UTMobileNet2021 datasets were used. Binary and multiclass classifications were made for the ISCX VPN-Non VPN dataset, and multiclass classifications were made for the UTMobileNet2021 dataset by using various algorithms on the datasets. The proposed model has been compared with eleven different algorithms and hybrid methods. At the same time, the effect of data balancing and feature selection on classification performance is examined. As a result, the proposed model achieved 93.91%, 82.68% and 96.83% accuracy rates in ISCX VPN-Non VPN binary and multiclass, UTMobileNet2021 multiclass classifications, respectively.
first_indexed 2024-04-10T04:36:56Z
format Article
id doaj.art-df1099745a1746fc9fba48899f6f6c4d
institution Directory Open Access Journal
issn 2169-3536
language English
last_indexed 2024-04-10T04:36:56Z
publishDate 2023-01-01
publisher IEEE
record_format Article
series IEEE Access
spelling doaj.art-df1099745a1746fc9fba48899f6f6c4d2023-03-10T00:00:34ZengIEEEIEEE Access2169-35362023-01-0111218312184710.1109/ACCESS.2023.325174510057377RFSE-GRU: Data Balanced Classification Model for Mobile Encrypted Traffic in Big Data EnvironmentMurat Dener0https://orcid.org/0000-0001-5746-6141Samed Al1https://orcid.org/0000-0002-2208-0385Gokce Ok2Department of Information Security Engineering, Graduate School of Natural and Applied Sciences, Gazi University, Ankara, TurkeyDepartment of Information Security Engineering, Graduate School of Natural and Applied Sciences, Gazi University, Ankara, TurkeyDepartment of Information Security Engineering, Graduate School of Natural and Applied Sciences, Gazi University, Ankara, TurkeyWith the widespread use of mobile technologies and the Internet, traffic in mobile networks is increasing. This situation has made the classification of traffic an important element for data security and network management. However, encryption of traffic in modern networks makes it difficult to classify traffic with traditional methods. In this study, a unique deep learning-based classification model is proposed for the classification of encrypted mobile traffic data. The proposed model is a classification model called RFSE-GRU, which combines the Gated Recurrent Units (GRU) algorithm, feature selection and data balancing. The features that are more meaningful in the classification process are determined by selecting the features with the Random Forest algorithm. In addition, Synthetic Minority Oversampling Technique (SMOTE) oversampling algorithm and Edited Nearest Neighbor (ENN) undersampling algorithm were used together to reduce the negative impact of data imbalance on classification performance. The study was carried out on Apache Spark’s big data platform in the Google Colab environment. In the study, ISCX VPN-Non VPN and UTMobileNet2021 datasets were used. Binary and multiclass classifications were made for the ISCX VPN-Non VPN dataset, and multiclass classifications were made for the UTMobileNet2021 dataset by using various algorithms on the datasets. The proposed model has been compared with eleven different algorithms and hybrid methods. At the same time, the effect of data balancing and feature selection on classification performance is examined. As a result, the proposed model achieved 93.91%, 82.68% and 96.83% accuracy rates in ISCX VPN-Non VPN binary and multiclass, UTMobileNet2021 multiclass classifications, respectively.https://ieeexplore.ieee.org/document/10057377/Mobile encrypted trafficVPNbig datamachine learningdeep learningApache Spark
spellingShingle Murat Dener
Samed Al
Gokce Ok
RFSE-GRU: Data Balanced Classification Model for Mobile Encrypted Traffic in Big Data Environment
IEEE Access
Mobile encrypted traffic
VPN
big data
machine learning
deep learning
Apache Spark
title RFSE-GRU: Data Balanced Classification Model for Mobile Encrypted Traffic in Big Data Environment
title_full RFSE-GRU: Data Balanced Classification Model for Mobile Encrypted Traffic in Big Data Environment
title_fullStr RFSE-GRU: Data Balanced Classification Model for Mobile Encrypted Traffic in Big Data Environment
title_full_unstemmed RFSE-GRU: Data Balanced Classification Model for Mobile Encrypted Traffic in Big Data Environment
title_short RFSE-GRU: Data Balanced Classification Model for Mobile Encrypted Traffic in Big Data Environment
title_sort rfse gru data balanced classification model for mobile encrypted traffic in big data environment
topic Mobile encrypted traffic
VPN
big data
machine learning
deep learning
Apache Spark
url https://ieeexplore.ieee.org/document/10057377/
work_keys_str_mv AT muratdener rfsegrudatabalancedclassificationmodelformobileencryptedtrafficinbigdataenvironment
AT samedal rfsegrudatabalancedclassificationmodelformobileencryptedtrafficinbigdataenvironment
AT gokceok rfsegrudatabalancedclassificationmodelformobileencryptedtrafficinbigdataenvironment