Heat Prediction of High Energy Physical Data Based on LSTM Recurrent Neural Network

High-energy physics computing is a typical data-intensive calculation. Each year, petabytes of data needs to be analyzed, and data access performance is increasingly demanding. The tiered storage system scheme for building a unified namespace has been widely adopted. Generally, data is stored on sto...

Full description

Bibliographic Details
Main Authors: Cheng Zhenjing, Wang Lu, Cheng Yaodong, Chen Gang
Format: Article
Language:English
Published: EDP Sciences 2020-01-01
Series:EPJ Web of Conferences
Online Access:https://www.epj-conferences.org/articles/epjconf/pdf/2020/21/epjconf_chep2020_04002.pdf
_version_ 1818643610457866240
author Cheng Zhenjing
Wang Lu
Cheng Yaodong
Chen Gang
author_facet Cheng Zhenjing
Wang Lu
Cheng Yaodong
Chen Gang
author_sort Cheng Zhenjing
collection DOAJ
description High-energy physics computing is a typical data-intensive calculation. Each year, petabytes of data needs to be analyzed, and data access performance is increasingly demanding. The tiered storage system scheme for building a unified namespace has been widely adopted. Generally, data is stored on storage devices with different performances and different prices according to different access frequency. When the heat of the data changes, the data is then migrated to the appropriate storage tier. At present, heuristic algorithms based on artificial experience are widely used in data heat prediction. Due to the differences in computing models of different users, the accuracy of prediction is low. A method for predicting future access popularity based on file access characteristics with the help of LSTM deep learning algorithm is proposed as the basis for data migration in hierarchical storage. This paper uses the real data of high-energy physics experiment LHAASO as an example for comparative testing. The results show that under the same test conditions, the model has higher prediction accuracy and stronger applicability than existing prediction models.
first_indexed 2024-12-17T00:01:41Z
format Article
id doaj.art-26cebac431194ed692d5d8066d38f720
institution Directory Open Access Journal
issn 2100-014X
language English
last_indexed 2024-12-17T00:01:41Z
publishDate 2020-01-01
publisher EDP Sciences
record_format Article
series EPJ Web of Conferences
spelling doaj.art-26cebac431194ed692d5d8066d38f7202022-12-21T22:11:04ZengEDP SciencesEPJ Web of Conferences2100-014X2020-01-012450400210.1051/epjconf/202024504002epjconf_chep2020_04002Heat Prediction of High Energy Physical Data Based on LSTM Recurrent Neural NetworkCheng ZhenjingWang LuCheng YaodongChen Gang0IHEP computing centerHigh-energy physics computing is a typical data-intensive calculation. Each year, petabytes of data needs to be analyzed, and data access performance is increasingly demanding. The tiered storage system scheme for building a unified namespace has been widely adopted. Generally, data is stored on storage devices with different performances and different prices according to different access frequency. When the heat of the data changes, the data is then migrated to the appropriate storage tier. At present, heuristic algorithms based on artificial experience are widely used in data heat prediction. Due to the differences in computing models of different users, the accuracy of prediction is low. A method for predicting future access popularity based on file access characteristics with the help of LSTM deep learning algorithm is proposed as the basis for data migration in hierarchical storage. This paper uses the real data of high-energy physics experiment LHAASO as an example for comparative testing. The results show that under the same test conditions, the model has higher prediction accuracy and stronger applicability than existing prediction models.https://www.epj-conferences.org/articles/epjconf/pdf/2020/21/epjconf_chep2020_04002.pdf
spellingShingle Cheng Zhenjing
Wang Lu
Cheng Yaodong
Chen Gang
Heat Prediction of High Energy Physical Data Based on LSTM Recurrent Neural Network
EPJ Web of Conferences
title Heat Prediction of High Energy Physical Data Based on LSTM Recurrent Neural Network
title_full Heat Prediction of High Energy Physical Data Based on LSTM Recurrent Neural Network
title_fullStr Heat Prediction of High Energy Physical Data Based on LSTM Recurrent Neural Network
title_full_unstemmed Heat Prediction of High Energy Physical Data Based on LSTM Recurrent Neural Network
title_short Heat Prediction of High Energy Physical Data Based on LSTM Recurrent Neural Network
title_sort heat prediction of high energy physical data based on lstm recurrent neural network
url https://www.epj-conferences.org/articles/epjconf/pdf/2020/21/epjconf_chep2020_04002.pdf
work_keys_str_mv AT chengzhenjing heatpredictionofhighenergyphysicaldatabasedonlstmrecurrentneuralnetwork
AT wanglu heatpredictionofhighenergyphysicaldatabasedonlstmrecurrentneuralnetwork
AT chengyaodong heatpredictionofhighenergyphysicaldatabasedonlstmrecurrentneuralnetwork
AT chengang heatpredictionofhighenergyphysicaldatabasedonlstmrecurrentneuralnetwork