Network Intrusion Detection Based on Feature Image and Deformable Vision Transformer Classification

Network intrusion detection technology has always been an indispensable protection mechanism for industrial network security. The rise of new forms of network attacks has resulted in a heightened demand for these technologies. Nevertheless, the current models’ effectiveness is subpar. We...

Full description

Bibliographic Details
Main Authors: Kan He, Wei Zhang, Xuejun Zong, Lian Lian
Format: Article
Language:English
Published: IEEE 2024-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/10468586/
_version_ 1827303765401665536
author Kan He
Wei Zhang
Xuejun Zong
Lian Lian
author_facet Kan He
Wei Zhang
Xuejun Zong
Lian Lian
author_sort Kan He
collection DOAJ
description Network intrusion detection technology has always been an indispensable protection mechanism for industrial network security. The rise of new forms of network attacks has resulted in a heightened demand for these technologies. Nevertheless, the current models’ effectiveness is subpar. We propose a new Deformable Vision Transformer (DE-VIT) method to address this issue. DE-VIT introduces a new deformable attention mechanism module, where the positions of key-value pairs in the attention mechanism are selected in a data-dependent manner, allowing it to focus on relevant areas, capture more informative features, and avoid excessive memory and computational costs. In addition to using deformable convolutions instead of regular convolutions in embedding layers to enhance the receptive field of patches, a sliding window mechanism is also employed to utilize edge information fully. In Parallel, we use a layered focal loss function to improve classification performance and address data imbalance issues. In summary, DE-VIT reduces computational complexity and achieves better results. We conduct experimental simulations on the public intrusion detection datasets, and the accuracy of the enhanced intrusion detection model surpasses that of the Deep Belief Network with Improved Kernel-Based Extreme Learning (DBN-KELM). It reaches 99.5% and 97.5% on the CIC IDS2017 and UNSW-NB15 datasets, exhibiting an increase of 8.5% and 9.1%, respectively.
first_indexed 2024-04-24T17:06:42Z
format Article
id doaj.art-e901a45c42e44526b8a4b5cc31a67626
institution Directory Open Access Journal
issn 2169-3536
language English
last_indexed 2024-04-24T17:06:42Z
publishDate 2024-01-01
publisher IEEE
record_format Article
series IEEE Access
spelling doaj.art-e901a45c42e44526b8a4b5cc31a676262024-03-28T23:00:34ZengIEEEIEEE Access2169-35362024-01-0112443354435010.1109/ACCESS.2024.337643410468586Network Intrusion Detection Based on Feature Image and Deformable Vision Transformer ClassificationKan He0Wei Zhang1https://orcid.org/0009-0001-1412-8094Xuejun Zong2https://orcid.org/0009-0000-0084-2775Lian Lian3College of Information Engineering, Shenyang University of Chemical Technology, Shenyang, ChinaCollege of Information Engineering, Shenyang University of Chemical Technology, Shenyang, ChinaCollege of Information Engineering, Shenyang University of Chemical Technology, Shenyang, ChinaCollege of Information Engineering, Shenyang University of Chemical Technology, Shenyang, ChinaNetwork intrusion detection technology has always been an indispensable protection mechanism for industrial network security. The rise of new forms of network attacks has resulted in a heightened demand for these technologies. Nevertheless, the current models’ effectiveness is subpar. We propose a new Deformable Vision Transformer (DE-VIT) method to address this issue. DE-VIT introduces a new deformable attention mechanism module, where the positions of key-value pairs in the attention mechanism are selected in a data-dependent manner, allowing it to focus on relevant areas, capture more informative features, and avoid excessive memory and computational costs. In addition to using deformable convolutions instead of regular convolutions in embedding layers to enhance the receptive field of patches, a sliding window mechanism is also employed to utilize edge information fully. In Parallel, we use a layered focal loss function to improve classification performance and address data imbalance issues. In summary, DE-VIT reduces computational complexity and achieves better results. We conduct experimental simulations on the public intrusion detection datasets, and the accuracy of the enhanced intrusion detection model surpasses that of the Deep Belief Network with Improved Kernel-Based Extreme Learning (DBN-KELM). It reaches 99.5% and 97.5% on the CIC IDS2017 and UNSW-NB15 datasets, exhibiting an increase of 8.5% and 9.1%, respectively.https://ieeexplore.ieee.org/document/10468586/Network intrusion detectiondeformable vision transformerdeformable convolutiondeformable attention mechanismvision transformer
spellingShingle Kan He
Wei Zhang
Xuejun Zong
Lian Lian
Network Intrusion Detection Based on Feature Image and Deformable Vision Transformer Classification
IEEE Access
Network intrusion detection
deformable vision transformer
deformable convolution
deformable attention mechanism
vision transformer
title Network Intrusion Detection Based on Feature Image and Deformable Vision Transformer Classification
title_full Network Intrusion Detection Based on Feature Image and Deformable Vision Transformer Classification
title_fullStr Network Intrusion Detection Based on Feature Image and Deformable Vision Transformer Classification
title_full_unstemmed Network Intrusion Detection Based on Feature Image and Deformable Vision Transformer Classification
title_short Network Intrusion Detection Based on Feature Image and Deformable Vision Transformer Classification
title_sort network intrusion detection based on feature image and deformable vision transformer classification
topic Network intrusion detection
deformable vision transformer
deformable convolution
deformable attention mechanism
vision transformer
url https://ieeexplore.ieee.org/document/10468586/
work_keys_str_mv AT kanhe networkintrusiondetectionbasedonfeatureimageanddeformablevisiontransformerclassification
AT weizhang networkintrusiondetectionbasedonfeatureimageanddeformablevisiontransformerclassification
AT xuejunzong networkintrusiondetectionbasedonfeatureimageanddeformablevisiontransformerclassification
AT lianlian networkintrusiondetectionbasedonfeatureimageanddeformablevisiontransformerclassification