RFSE-GRU: Data Balanced Classification Model for Mobile Encrypted Traffic in Big Data Environment
With the widespread use of mobile technologies and the Internet, traffic in mobile networks is increasing. This situation has made the classification of traffic an important element for data security and network management. However, encryption of traffic in modern networks makes it difficult to clas...
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
IEEE
2023-01-01
|
Series: | IEEE Access |
Subjects: | |
Online Access: | https://ieeexplore.ieee.org/document/10057377/ |
_version_ | 1811155580153233408 |
---|---|
author | Murat Dener Samed Al Gokce Ok |
author_facet | Murat Dener Samed Al Gokce Ok |
author_sort | Murat Dener |
collection | DOAJ |
description | With the widespread use of mobile technologies and the Internet, traffic in mobile networks is increasing. This situation has made the classification of traffic an important element for data security and network management. However, encryption of traffic in modern networks makes it difficult to classify traffic with traditional methods. In this study, a unique deep learning-based classification model is proposed for the classification of encrypted mobile traffic data. The proposed model is a classification model called RFSE-GRU, which combines the Gated Recurrent Units (GRU) algorithm, feature selection and data balancing. The features that are more meaningful in the classification process are determined by selecting the features with the Random Forest algorithm. In addition, Synthetic Minority Oversampling Technique (SMOTE) oversampling algorithm and Edited Nearest Neighbor (ENN) undersampling algorithm were used together to reduce the negative impact of data imbalance on classification performance. The study was carried out on Apache Spark’s big data platform in the Google Colab environment. In the study, ISCX VPN-Non VPN and UTMobileNet2021 datasets were used. Binary and multiclass classifications were made for the ISCX VPN-Non VPN dataset, and multiclass classifications were made for the UTMobileNet2021 dataset by using various algorithms on the datasets. The proposed model has been compared with eleven different algorithms and hybrid methods. At the same time, the effect of data balancing and feature selection on classification performance is examined. As a result, the proposed model achieved 93.91%, 82.68% and 96.83% accuracy rates in ISCX VPN-Non VPN binary and multiclass, UTMobileNet2021 multiclass classifications, respectively. |
first_indexed | 2024-04-10T04:36:56Z |
format | Article |
id | doaj.art-df1099745a1746fc9fba48899f6f6c4d |
institution | Directory Open Access Journal |
issn | 2169-3536 |
language | English |
last_indexed | 2024-04-10T04:36:56Z |
publishDate | 2023-01-01 |
publisher | IEEE |
record_format | Article |
series | IEEE Access |
spelling | doaj.art-df1099745a1746fc9fba48899f6f6c4d2023-03-10T00:00:34ZengIEEEIEEE Access2169-35362023-01-0111218312184710.1109/ACCESS.2023.325174510057377RFSE-GRU: Data Balanced Classification Model for Mobile Encrypted Traffic in Big Data EnvironmentMurat Dener0https://orcid.org/0000-0001-5746-6141Samed Al1https://orcid.org/0000-0002-2208-0385Gokce Ok2Department of Information Security Engineering, Graduate School of Natural and Applied Sciences, Gazi University, Ankara, TurkeyDepartment of Information Security Engineering, Graduate School of Natural and Applied Sciences, Gazi University, Ankara, TurkeyDepartment of Information Security Engineering, Graduate School of Natural and Applied Sciences, Gazi University, Ankara, TurkeyWith the widespread use of mobile technologies and the Internet, traffic in mobile networks is increasing. This situation has made the classification of traffic an important element for data security and network management. However, encryption of traffic in modern networks makes it difficult to classify traffic with traditional methods. In this study, a unique deep learning-based classification model is proposed for the classification of encrypted mobile traffic data. The proposed model is a classification model called RFSE-GRU, which combines the Gated Recurrent Units (GRU) algorithm, feature selection and data balancing. The features that are more meaningful in the classification process are determined by selecting the features with the Random Forest algorithm. In addition, Synthetic Minority Oversampling Technique (SMOTE) oversampling algorithm and Edited Nearest Neighbor (ENN) undersampling algorithm were used together to reduce the negative impact of data imbalance on classification performance. The study was carried out on Apache Spark’s big data platform in the Google Colab environment. In the study, ISCX VPN-Non VPN and UTMobileNet2021 datasets were used. Binary and multiclass classifications were made for the ISCX VPN-Non VPN dataset, and multiclass classifications were made for the UTMobileNet2021 dataset by using various algorithms on the datasets. The proposed model has been compared with eleven different algorithms and hybrid methods. At the same time, the effect of data balancing and feature selection on classification performance is examined. As a result, the proposed model achieved 93.91%, 82.68% and 96.83% accuracy rates in ISCX VPN-Non VPN binary and multiclass, UTMobileNet2021 multiclass classifications, respectively.https://ieeexplore.ieee.org/document/10057377/Mobile encrypted trafficVPNbig datamachine learningdeep learningApache Spark |
spellingShingle | Murat Dener Samed Al Gokce Ok RFSE-GRU: Data Balanced Classification Model for Mobile Encrypted Traffic in Big Data Environment IEEE Access Mobile encrypted traffic VPN big data machine learning deep learning Apache Spark |
title | RFSE-GRU: Data Balanced Classification Model for Mobile Encrypted Traffic in Big Data Environment |
title_full | RFSE-GRU: Data Balanced Classification Model for Mobile Encrypted Traffic in Big Data Environment |
title_fullStr | RFSE-GRU: Data Balanced Classification Model for Mobile Encrypted Traffic in Big Data Environment |
title_full_unstemmed | RFSE-GRU: Data Balanced Classification Model for Mobile Encrypted Traffic in Big Data Environment |
title_short | RFSE-GRU: Data Balanced Classification Model for Mobile Encrypted Traffic in Big Data Environment |
title_sort | rfse gru data balanced classification model for mobile encrypted traffic in big data environment |
topic | Mobile encrypted traffic VPN big data machine learning deep learning Apache Spark |
url | https://ieeexplore.ieee.org/document/10057377/ |
work_keys_str_mv | AT muratdener rfsegrudatabalancedclassificationmodelformobileencryptedtrafficinbigdataenvironment AT samedal rfsegrudatabalancedclassificationmodelformobileencryptedtrafficinbigdataenvironment AT gokceok rfsegrudatabalancedclassificationmodelformobileencryptedtrafficinbigdataenvironment |