An Online Parsing Framework for Semistructured Streaming System Logs of Internet of Things Systems

This article presents a novel log abstraction framework based on neural open information extraction (OpenIE) and dynamic word embedding principles. Though various log parsing frameworks are proposed in the literature, the existing frameworks are modeled on predefined heuristics or auto-regressive me...

Full description

Bibliographic Details
Main Authors: Susnata Bhattacharya, Biplob Ray, Ritesh Chugh, Steven Gordon
Format: Article
Language:English
Published: IEEE 2023-01-01
Series:IEEE Open Journal of Instrumentation and Measurement
Subjects:
Online Access:https://ieeexplore.ieee.org/document/10004508/
_version_ 1797197416746713088
author Susnata Bhattacharya
Biplob Ray
Ritesh Chugh
Steven Gordon
author_facet Susnata Bhattacharya
Biplob Ray
Ritesh Chugh
Steven Gordon
author_sort Susnata Bhattacharya
collection DOAJ
description This article presents a novel log abstraction framework based on neural open information extraction (OpenIE) and dynamic word embedding principles. Though various log parsing frameworks are proposed in the literature, the existing frameworks are modeled on predefined heuristics or auto-regressive methodologies that work well in offline scenarios. However, these frameworks are less suitable for dynamic self-adaptive systems, such as the Internet of Things (IoT), where the log outputs have diverse contextual variations and disparate time irregularities. Therefore, it is essential to move away from these traditional approaches and develop a systematic model that can effectively analyze log outputs in real-time and increase the system up-time of IoT networks so that they are almost always available. To address these needs, the proposed framework used OpenIE along with term frequency/inverse document frequency (TF/IDF) vectorization for constructing a set of relational triples (aka triple-sets). Additionally, a dynamic pretrained encoder–decoder architecture is utilized to imbibe the positional and contextualized information in its resultant outputs. The adopted methodology has enabled the proposed framework to extract richer word representations with dynamic contextualization of time-sensitive event logs to enhance further downstream activities, such as failure prediction and prognostic analysis of IoT networks. The proposed framework is evaluated on the system event log traces accumulated from a long range wide-area network (LoRaWAN) IoT gateway to proactively determine the probable causes of its various failure scenarios. Additionally, the study also provided a comparative analysis of its mathematical representations with that of the current state-of-the-art (SOTA) approaches to project the advantages and benefits of the proposed model, particularly from its data analytics standpoint.
first_indexed 2024-04-24T06:43:37Z
format Article
id doaj.art-bf7741de78644882ada23d47de2b823a
institution Directory Open Access Journal
issn 2768-7236
language English
last_indexed 2024-04-24T06:43:37Z
publishDate 2023-01-01
publisher IEEE
record_format Article
series IEEE Open Journal of Instrumentation and Measurement
spelling doaj.art-bf7741de78644882ada23d47de2b823a2024-04-22T20:23:39ZengIEEEIEEE Open Journal of Instrumentation and Measurement2768-72362023-01-01211810.1109/OJIM.2022.323265010004508An Online Parsing Framework for Semistructured Streaming System Logs of Internet of Things SystemsSusnata Bhattacharya0https://orcid.org/0000-0002-8189-8435Biplob Ray1https://orcid.org/0000-0002-3016-1695Ritesh Chugh2https://orcid.org/0000-0003-0061-7206Steven Gordon3https://orcid.org/0000-0003-4090-1199School of Engineering and Technology, Central Queensland University, Rockhampton, QLD, AustraliaSchool of Engineering and Technology, Central Queensland University, Rockhampton, QLD, AustraliaSchool of Engineering and Technology, Central Queensland University, Rockhampton, QLD, AustraliaSchool of Engineering and Technology, Central Queensland University, Rockhampton, QLD, AustraliaThis article presents a novel log abstraction framework based on neural open information extraction (OpenIE) and dynamic word embedding principles. Though various log parsing frameworks are proposed in the literature, the existing frameworks are modeled on predefined heuristics or auto-regressive methodologies that work well in offline scenarios. However, these frameworks are less suitable for dynamic self-adaptive systems, such as the Internet of Things (IoT), where the log outputs have diverse contextual variations and disparate time irregularities. Therefore, it is essential to move away from these traditional approaches and develop a systematic model that can effectively analyze log outputs in real-time and increase the system up-time of IoT networks so that they are almost always available. To address these needs, the proposed framework used OpenIE along with term frequency/inverse document frequency (TF/IDF) vectorization for constructing a set of relational triples (aka triple-sets). Additionally, a dynamic pretrained encoder–decoder architecture is utilized to imbibe the positional and contextualized information in its resultant outputs. The adopted methodology has enabled the proposed framework to extract richer word representations with dynamic contextualization of time-sensitive event logs to enhance further downstream activities, such as failure prediction and prognostic analysis of IoT networks. The proposed framework is evaluated on the system event log traces accumulated from a long range wide-area network (LoRaWAN) IoT gateway to proactively determine the probable causes of its various failure scenarios. Additionally, the study also provided a comparative analysis of its mathematical representations with that of the current state-of-the-art (SOTA) approaches to project the advantages and benefits of the proposed model, particularly from its data analytics standpoint.https://ieeexplore.ieee.org/document/10004508/BERTcontext-awaregatewayinformation extractionInternet of Things (IoT)log parsing
spellingShingle Susnata Bhattacharya
Biplob Ray
Ritesh Chugh
Steven Gordon
An Online Parsing Framework for Semistructured Streaming System Logs of Internet of Things Systems
IEEE Open Journal of Instrumentation and Measurement
BERT
context-aware
gateway
information extraction
Internet of Things (IoT)
log parsing
title An Online Parsing Framework for Semistructured Streaming System Logs of Internet of Things Systems
title_full An Online Parsing Framework for Semistructured Streaming System Logs of Internet of Things Systems
title_fullStr An Online Parsing Framework for Semistructured Streaming System Logs of Internet of Things Systems
title_full_unstemmed An Online Parsing Framework for Semistructured Streaming System Logs of Internet of Things Systems
title_short An Online Parsing Framework for Semistructured Streaming System Logs of Internet of Things Systems
title_sort online parsing framework for semistructured streaming system logs of internet of things systems
topic BERT
context-aware
gateway
information extraction
Internet of Things (IoT)
log parsing
url https://ieeexplore.ieee.org/document/10004508/
work_keys_str_mv AT susnatabhattacharya anonlineparsingframeworkforsemistructuredstreamingsystemlogsofinternetofthingssystems
AT biplobray anonlineparsingframeworkforsemistructuredstreamingsystemlogsofinternetofthingssystems
AT riteshchugh anonlineparsingframeworkforsemistructuredstreamingsystemlogsofinternetofthingssystems
AT stevengordon anonlineparsingframeworkforsemistructuredstreamingsystemlogsofinternetofthingssystems
AT susnatabhattacharya onlineparsingframeworkforsemistructuredstreamingsystemlogsofinternetofthingssystems
AT biplobray onlineparsingframeworkforsemistructuredstreamingsystemlogsofinternetofthingssystems
AT riteshchugh onlineparsingframeworkforsemistructuredstreamingsystemlogsofinternetofthingssystems
AT stevengordon onlineparsingframeworkforsemistructuredstreamingsystemlogsofinternetofthingssystems