LTAnomaly: A Transformer Variant for Syslog Anomaly Detection Based on Multi-Scale Representation and Long Sequence Capture

Detailed information on system operation is recorded by system logs, from which fast and accurate detection of anomalies is conducive to service management and system maintenance. Log anomaly detection methods often only handle a single type of anomaly, and the utilization of log messages could be h...

Full description

Bibliographic Details
Main Authors: Delong Han, Mengjie Sun, Min Li, Qinghui Chen
Format: Article
Language:English
Published: MDPI AG 2023-06-01
Series:Applied Sciences
Subjects:
Online Access:https://www.mdpi.com/2076-3417/13/13/7668
_version_ 1797592131890577408
author Delong Han
Mengjie Sun
Min Li
Qinghui Chen
author_facet Delong Han
Mengjie Sun
Min Li
Qinghui Chen
author_sort Delong Han
collection DOAJ
description Detailed information on system operation is recorded by system logs, from which fast and accurate detection of anomalies is conducive to service management and system maintenance. Log anomaly detection methods often only handle a single type of anomaly, and the utilization of log messages could be higher, which makes it challenging to improve the performance of log anomaly detection models. This article presents the LTAnomaly model to accomplish log anomaly detection using semantic information, sequence relationships, and component values to make a vector representation of logs, and we add Transformer with long short-term memory (LSTM) as our final classification model. When sequences are processed sequentially, the model is also influenced by the information from the global information, thus increasing the dependence on feature information. This improves the utilization of log messages with a flexible, simple, and robust model. To evaluate the effectiveness of our method, experiments are performed on the HDFS and BGL datasets, with the F1-measures reaching 0.985 and 0.975, respectively, showing that the proposed method enjoys higher accuracy and a more comprehensive application range than existing models.
first_indexed 2024-03-11T01:47:07Z
format Article
id doaj.art-739ce4fb55c943208029bb8a6ea46940
institution Directory Open Access Journal
issn 2076-3417
language English
last_indexed 2024-03-11T01:47:07Z
publishDate 2023-06-01
publisher MDPI AG
record_format Article
series Applied Sciences
spelling doaj.art-739ce4fb55c943208029bb8a6ea469402023-11-18T16:09:45ZengMDPI AGApplied Sciences2076-34172023-06-011313766810.3390/app13137668LTAnomaly: A Transformer Variant for Syslog Anomaly Detection Based on Multi-Scale Representation and Long Sequence CaptureDelong Han0Mengjie Sun1Min Li2Qinghui Chen3Key Laboratory of Computing Power Network and Information Security, Ministry of Education, Shandong Computer Science Center (National Supercomputer Center in Jinan), Qilu University of Technology (Shandong Academy of Sciences), Jinan 250014, ChinaKey Laboratory of Computing Power Network and Information Security, Ministry of Education, Shandong Computer Science Center (National Supercomputer Center in Jinan), Qilu University of Technology (Shandong Academy of Sciences), Jinan 250014, ChinaKey Laboratory of Computing Power Network and Information Security, Ministry of Education, Shandong Computer Science Center (National Supercomputer Center in Jinan), Qilu University of Technology (Shandong Academy of Sciences), Jinan 250014, ChinaKey Laboratory of Computing Power Network and Information Security, Ministry of Education, Shandong Computer Science Center (National Supercomputer Center in Jinan), Qilu University of Technology (Shandong Academy of Sciences), Jinan 250014, ChinaDetailed information on system operation is recorded by system logs, from which fast and accurate detection of anomalies is conducive to service management and system maintenance. Log anomaly detection methods often only handle a single type of anomaly, and the utilization of log messages could be higher, which makes it challenging to improve the performance of log anomaly detection models. This article presents the LTAnomaly model to accomplish log anomaly detection using semantic information, sequence relationships, and component values to make a vector representation of logs, and we add Transformer with long short-term memory (LSTM) as our final classification model. When sequences are processed sequentially, the model is also influenced by the information from the global information, thus increasing the dependence on feature information. This improves the utilization of log messages with a flexible, simple, and robust model. To evaluate the effectiveness of our method, experiments are performed on the HDFS and BGL datasets, with the F1-measures reaching 0.985 and 0.975, respectively, showing that the proposed method enjoys higher accuracy and a more comprehensive application range than existing models.https://www.mdpi.com/2076-3417/13/13/7668anomaly detectiondeep learninglog analysis
spellingShingle Delong Han
Mengjie Sun
Min Li
Qinghui Chen
LTAnomaly: A Transformer Variant for Syslog Anomaly Detection Based on Multi-Scale Representation and Long Sequence Capture
Applied Sciences
anomaly detection
deep learning
log analysis
title LTAnomaly: A Transformer Variant for Syslog Anomaly Detection Based on Multi-Scale Representation and Long Sequence Capture
title_full LTAnomaly: A Transformer Variant for Syslog Anomaly Detection Based on Multi-Scale Representation and Long Sequence Capture
title_fullStr LTAnomaly: A Transformer Variant for Syslog Anomaly Detection Based on Multi-Scale Representation and Long Sequence Capture
title_full_unstemmed LTAnomaly: A Transformer Variant for Syslog Anomaly Detection Based on Multi-Scale Representation and Long Sequence Capture
title_short LTAnomaly: A Transformer Variant for Syslog Anomaly Detection Based on Multi-Scale Representation and Long Sequence Capture
title_sort ltanomaly a transformer variant for syslog anomaly detection based on multi scale representation and long sequence capture
topic anomaly detection
deep learning
log analysis
url https://www.mdpi.com/2076-3417/13/13/7668
work_keys_str_mv AT delonghan ltanomalyatransformervariantforsysloganomalydetectionbasedonmultiscalerepresentationandlongsequencecapture
AT mengjiesun ltanomalyatransformervariantforsysloganomalydetectionbasedonmultiscalerepresentationandlongsequencecapture
AT minli ltanomalyatransformervariantforsysloganomalydetectionbasedonmultiscalerepresentationandlongsequencecapture
AT qinghuichen ltanomalyatransformervariantforsysloganomalydetectionbasedonmultiscalerepresentationandlongsequencecapture