Governance and sustainability of distributed continuum systems: a big data approach

Abstract Distributed computing continuum systems (DCCS) make use of a vast number of computing devices to process data generated by edge devices such as the Internet of Things and sensor nodes. Besides performing computations, these devices also produce data including, for example, event logs, confi...

Full description

Bibliographic Details
Main Authors: Praveen Kumar Donta, Boris Sedlak, Victor Casamayor Pujol, Schahram Dustdar
Format: Article
Language:English
Published: SpringerOpen 2023-04-01
Series:Journal of Big Data
Subjects:
Online Access:https://doi.org/10.1186/s40537-023-00737-0
_version_ 1797836427778588672
author Praveen Kumar Donta
Boris Sedlak
Victor Casamayor Pujol
Schahram Dustdar
author_facet Praveen Kumar Donta
Boris Sedlak
Victor Casamayor Pujol
Schahram Dustdar
author_sort Praveen Kumar Donta
collection DOAJ
description Abstract Distributed computing continuum systems (DCCS) make use of a vast number of computing devices to process data generated by edge devices such as the Internet of Things and sensor nodes. Besides performing computations, these devices also produce data including, for example, event logs, configuration files, network management information. When these data are analyzed, we can learn more about the devices, such as their capabilities, processing efficiency, resource usage, and failure prediction. However, these data are available in different forms and have different attributes due to the highly heterogeneous nature of DCCS. The diversity of data poses various challenges which we discuss by relating them to big data, so that we can utilize the advantages of big data analytical tools. We enumerate several existing tools that can perform the monitoring task and also summarize their characteristics. Further, we provide a general governance and sustainable architecture for DCCS, which reflects the human body’s self-healing model. The proposed model has three stages: first, it analyzes system data to acquire knowledge; second, it can leverage the knowledge to monitor and predict future conditions; and third, it takes further actions to autonomously solve any issue or to alert administrators. Thus, the DCCS model is designed to minimize the system’s downtime while optimizing resource usage. A small set of data is used to illustrate the monitoring and prediction of the performance of a system through Bayesian network structure learning. Finally, we discuss the limitations of the governance and sustainability model, and we provide possible solutions to overcome them and make the system more efficient.
first_indexed 2024-04-09T15:08:46Z
format Article
id doaj.art-9dd325ba2bb64904a9f7d6c4f4f52bd3
institution Directory Open Access Journal
issn 2196-1115
language English
last_indexed 2024-04-09T15:08:46Z
publishDate 2023-04-01
publisher SpringerOpen
record_format Article
series Journal of Big Data
spelling doaj.art-9dd325ba2bb64904a9f7d6c4f4f52bd32023-04-30T11:18:44ZengSpringerOpenJournal of Big Data2196-11152023-04-0110113110.1186/s40537-023-00737-0Governance and sustainability of distributed continuum systems: a big data approachPraveen Kumar Donta0Boris Sedlak1Victor Casamayor Pujol2Schahram Dustdar3Distributed Systems Group, TU WienDistributed Systems Group, TU WienDistributed Systems Group, TU WienDistributed Systems Group, TU WienAbstract Distributed computing continuum systems (DCCS) make use of a vast number of computing devices to process data generated by edge devices such as the Internet of Things and sensor nodes. Besides performing computations, these devices also produce data including, for example, event logs, configuration files, network management information. When these data are analyzed, we can learn more about the devices, such as their capabilities, processing efficiency, resource usage, and failure prediction. However, these data are available in different forms and have different attributes due to the highly heterogeneous nature of DCCS. The diversity of data poses various challenges which we discuss by relating them to big data, so that we can utilize the advantages of big data analytical tools. We enumerate several existing tools that can perform the monitoring task and also summarize their characteristics. Further, we provide a general governance and sustainable architecture for DCCS, which reflects the human body’s self-healing model. The proposed model has three stages: first, it analyzes system data to acquire knowledge; second, it can leverage the knowledge to monitor and predict future conditions; and third, it takes further actions to autonomously solve any issue or to alert administrators. Thus, the DCCS model is designed to minimize the system’s downtime while optimizing resource usage. A small set of data is used to illustrate the monitoring and prediction of the performance of a system through Bayesian network structure learning. Finally, we discuss the limitations of the governance and sustainability model, and we provide possible solutions to overcome them and make the system more efficient.https://doi.org/10.1186/s40537-023-00737-0Distributed computing continuum systemsBig data analyticsGovernance and sustainabilitySelf-healingRepresentation learningBayesian network structure learning
spellingShingle Praveen Kumar Donta
Boris Sedlak
Victor Casamayor Pujol
Schahram Dustdar
Governance and sustainability of distributed continuum systems: a big data approach
Journal of Big Data
Distributed computing continuum systems
Big data analytics
Governance and sustainability
Self-healing
Representation learning
Bayesian network structure learning
title Governance and sustainability of distributed continuum systems: a big data approach
title_full Governance and sustainability of distributed continuum systems: a big data approach
title_fullStr Governance and sustainability of distributed continuum systems: a big data approach
title_full_unstemmed Governance and sustainability of distributed continuum systems: a big data approach
title_short Governance and sustainability of distributed continuum systems: a big data approach
title_sort governance and sustainability of distributed continuum systems a big data approach
topic Distributed computing continuum systems
Big data analytics
Governance and sustainability
Self-healing
Representation learning
Bayesian network structure learning
url https://doi.org/10.1186/s40537-023-00737-0
work_keys_str_mv AT praveenkumardonta governanceandsustainabilityofdistributedcontinuumsystemsabigdataapproach
AT borissedlak governanceandsustainabilityofdistributedcontinuumsystemsabigdataapproach
AT victorcasamayorpujol governanceandsustainabilityofdistributedcontinuumsystemsabigdataapproach
AT schahramdustdar governanceandsustainabilityofdistributedcontinuumsystemsabigdataapproach