Scalable Communication-Induced Checkpointing Protocol with Little Overhead for Distributed Computing Environments
The existing communication-induced checkpointing protocols may not scale well due to their slow acquisition of the most recent timestamps of the next checkpoints of other processes. Accurate situation awareness with diversified information conveyance paths is needed to reduce the number of unnecessa...
Main Author: | Jinho Ahn |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI AG
2023-06-01
|
Series: | Electronics |
Subjects: | |
Online Access: | https://www.mdpi.com/2079-9292/12/12/2702 |
Similar Items
-
Communication-Induced Checkpointing with Message Logging beyond the Piecewise Deterministic (PWD) Model for Distributed Systems
by: Jinho Ahn
Published: (2021-06-01) -
Fault Tolerant Distributed Stream Processing based on Backtracking
by: Qiming Chen, et al.
Published: (2013-11-01) -
A Checkpoint/Restart Scheme for CUDA Programs with Complex Computation States
by: Hai Jiang, et al.
Published: (2013-11-01) -
A Scalable Byzantine Fault Tolerance Algorithm Based on a Tree Topology Network
by: Wangxi Jiang, et al.
Published: (2023-01-01) -
Functional Safety Support for the Specialized Computers with Combined Fault-Tolerance Based on the Checkpoints Technique
by: A. E. Alexandrovich, et al.
Published: (2012-03-01)