Log mining and knowledge-based models in data storage systems diagnostics

Modern data storage systems have a sophisticated hardware and software architecture, including multiple storage processors, storage fabrics, network equipment and storage media and contain information, which can be damaged or lost because of hardware or software fault. Approach to storage software d...

Full description

Bibliographic Details
Main Author: Uspenskij Mikhail B.
Format: Article
Language:English
Published: EDP Sciences 2019-01-01
Series:E3S Web of Conferences
Online Access:https://www.e3s-conferences.org/articles/e3sconf/pdf/2019/66/e3sconf_eece18_03006.pdf
_version_ 1819173973395505152
author Uspenskij Mikhail B.
author_facet Uspenskij Mikhail B.
author_sort Uspenskij Mikhail B.
collection DOAJ
description Modern data storage systems have a sophisticated hardware and software architecture, including multiple storage processors, storage fabrics, network equipment and storage media and contain information, which can be damaged or lost because of hardware or software fault. Approach to storage software diagnostics, presented in current paper, combines a log mining algorithms for fault detection based on natural language processing text classification methods, and usage of the diagnostic model for a task of fault source detection. Currently existing approaches to computational systems diagnostics are either ignoring system or event log data, using only numeric monitoring parameters, or target only certain log types or use logs to create chains of the structured events. The main advantage of using natural language processing method for log text classification is that no information of log message structure or log message source, or log purpose is required if there is enough data for classificator model training. Developed diagnostic procedure has accuracy score comparable with existing methods and can target all presented in training set faults without prior log structure research.
first_indexed 2024-12-22T20:31:35Z
format Article
id doaj.art-b1f16209c7b442e5ab01491dee2f2949
institution Directory Open Access Journal
issn 2267-1242
language English
last_indexed 2024-12-22T20:31:35Z
publishDate 2019-01-01
publisher EDP Sciences
record_format Article
series E3S Web of Conferences
spelling doaj.art-b1f16209c7b442e5ab01491dee2f29492022-12-21T18:13:35ZengEDP SciencesE3S Web of Conferences2267-12422019-01-011400300610.1051/e3sconf/201914003006e3sconf_eece18_03006Log mining and knowledge-based models in data storage systems diagnosticsUspenskij Mikhail B.0Peter the Great St. Petersburg Polytechnic UniversityModern data storage systems have a sophisticated hardware and software architecture, including multiple storage processors, storage fabrics, network equipment and storage media and contain information, which can be damaged or lost because of hardware or software fault. Approach to storage software diagnostics, presented in current paper, combines a log mining algorithms for fault detection based on natural language processing text classification methods, and usage of the diagnostic model for a task of fault source detection. Currently existing approaches to computational systems diagnostics are either ignoring system or event log data, using only numeric monitoring parameters, or target only certain log types or use logs to create chains of the structured events. The main advantage of using natural language processing method for log text classification is that no information of log message structure or log message source, or log purpose is required if there is enough data for classificator model training. Developed diagnostic procedure has accuracy score comparable with existing methods and can target all presented in training set faults without prior log structure research.https://www.e3s-conferences.org/articles/e3sconf/pdf/2019/66/e3sconf_eece18_03006.pdf
spellingShingle Uspenskij Mikhail B.
Log mining and knowledge-based models in data storage systems diagnostics
E3S Web of Conferences
title Log mining and knowledge-based models in data storage systems diagnostics
title_full Log mining and knowledge-based models in data storage systems diagnostics
title_fullStr Log mining and knowledge-based models in data storage systems diagnostics
title_full_unstemmed Log mining and knowledge-based models in data storage systems diagnostics
title_short Log mining and knowledge-based models in data storage systems diagnostics
title_sort log mining and knowledge based models in data storage systems diagnostics
url https://www.e3s-conferences.org/articles/e3sconf/pdf/2019/66/e3sconf_eece18_03006.pdf
work_keys_str_mv AT uspenskijmikhailb logminingandknowledgebasedmodelsindatastoragesystemsdiagnostics