Self-Attentive Models for Real-Time Malware Classification

Malware classification is a critical task in cybersecurity, as it offers insights into the threats that malware poses to the victim device and helps in the design of countermeasures. For real-time malware classification, due to the high network throughputs of modern networks, there is a challenge of...

Full description

Bibliographic Details
Main Authors: Qikai Lu, Hongwen Zhang, Husam Kinawi, Di Niu
Format: Article
Language:English
Published: IEEE 2022-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/9877977/
_version_ 1828105265511137280
author Qikai Lu
Hongwen Zhang
Husam Kinawi
Di Niu
author_facet Qikai Lu
Hongwen Zhang
Husam Kinawi
Di Niu
author_sort Qikai Lu
collection DOAJ
description Malware classification is a critical task in cybersecurity, as it offers insights into the threats that malware poses to the victim device and helps in the design of countermeasures. For real-time malware classification, due to the high network throughputs of modern networks, there is a challenge of achieving high classification accuracy while maintaining low inference latency. We first introduce two self-attention transformer-based classifiers, SeqConvAttn and ImgConvAttn, to replace the currently predominant Convolutional Neural Network (CNN) classifiers. We then devise a file-size-aware two-stage framework to combine the two proposed models, thereby controlling the tradeoff between accuracy and latency for real-time classification. To assess our proposed designs, we conduct experiments on three malware datasets: the Microsoft Malware Classification Challenge (BIG 2015) and two selected subsets from the BODMAS PE malware dataset, BODMAS-11 and BODMAS-49. We show that our transformer-based designs can achieve better classification accuracy than traditional CNN-based designs. Furthermore, we show that the proposed two-stage framework reduces the average model inference latency while maintaining superior accuracy, thereby fulfilling the requirements of real-time classification.
first_indexed 2024-04-11T10:00:37Z
format Article
id doaj.art-73f372bdc3d74abaa8d32abb8f709c6b
institution Directory Open Access Journal
issn 2169-3536
language English
last_indexed 2024-04-11T10:00:37Z
publishDate 2022-01-01
publisher IEEE
record_format Article
series IEEE Access
spelling doaj.art-73f372bdc3d74abaa8d32abb8f709c6b2022-12-22T04:30:26ZengIEEEIEEE Access2169-35362022-01-0110959709598510.1109/ACCESS.2022.32029529877977Self-Attentive Models for Real-Time Malware ClassificationQikai Lu0Hongwen Zhang1Husam Kinawi2Di Niu3https://orcid.org/0000-0002-5250-7327Department of Electrical and Computer Engineering, University of Alberta, Edmonton, CanadaWedge Networks, Calgary, CanadaWedge Networks, Calgary, CanadaDepartment of Electrical and Computer Engineering, University of Alberta, Edmonton, CanadaMalware classification is a critical task in cybersecurity, as it offers insights into the threats that malware poses to the victim device and helps in the design of countermeasures. For real-time malware classification, due to the high network throughputs of modern networks, there is a challenge of achieving high classification accuracy while maintaining low inference latency. We first introduce two self-attention transformer-based classifiers, SeqConvAttn and ImgConvAttn, to replace the currently predominant Convolutional Neural Network (CNN) classifiers. We then devise a file-size-aware two-stage framework to combine the two proposed models, thereby controlling the tradeoff between accuracy and latency for real-time classification. To assess our proposed designs, we conduct experiments on three malware datasets: the Microsoft Malware Classification Challenge (BIG 2015) and two selected subsets from the BODMAS PE malware dataset, BODMAS-11 and BODMAS-49. We show that our transformer-based designs can achieve better classification accuracy than traditional CNN-based designs. Furthermore, we show that the proposed two-stage framework reduces the average model inference latency while maintaining superior accuracy, thereby fulfilling the requirements of real-time classification.https://ieeexplore.ieee.org/document/9877977/Malware classificationself-attention networksmulti-stage classificationcybersecurity
spellingShingle Qikai Lu
Hongwen Zhang
Husam Kinawi
Di Niu
Self-Attentive Models for Real-Time Malware Classification
IEEE Access
Malware classification
self-attention networks
multi-stage classification
cybersecurity
title Self-Attentive Models for Real-Time Malware Classification
title_full Self-Attentive Models for Real-Time Malware Classification
title_fullStr Self-Attentive Models for Real-Time Malware Classification
title_full_unstemmed Self-Attentive Models for Real-Time Malware Classification
title_short Self-Attentive Models for Real-Time Malware Classification
title_sort self attentive models for real time malware classification
topic Malware classification
self-attention networks
multi-stage classification
cybersecurity
url https://ieeexplore.ieee.org/document/9877977/
work_keys_str_mv AT qikailu selfattentivemodelsforrealtimemalwareclassification
AT hongwenzhang selfattentivemodelsforrealtimemalwareclassification
AT husamkinawi selfattentivemodelsforrealtimemalwareclassification
AT diniu selfattentivemodelsforrealtimemalwareclassification