Communication-Induced Checkpointing with Message Logging beyond the Piecewise Deterministic (PWD) Model for Distributed Systems
This paper introduces an effective communication-induced checkpointing protocol using message logging to enable the number of extra checkpoints to be far lower than the previous number. Even if a situation occurs in which it is decided that a process receiving a message has to perform forced checkpo...
Main Author: | |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI AG
2021-06-01
|
Series: | Electronics |
Subjects: | |
Online Access: | https://www.mdpi.com/2079-9292/10/12/1428 |
_version_ | 1797530160974528512 |
---|---|
author | Jinho Ahn |
author_facet | Jinho Ahn |
author_sort | Jinho Ahn |
collection | DOAJ |
description | This paper introduces an effective communication-induced checkpointing protocol using message logging to enable the number of extra checkpoints to be far lower than the previous number. Even if a situation occurs in which it is decided that a process receiving a message has to perform forced checkpointing, our protocol allows the process to skip the forced checkpointing action if it recognizes that the state of its sender right before the receipt of the message is recoverable. Additionally, the communication-induced checkpointing protocol is thus not required to assume the piecewise deterministic model, despite being combined with message logging. This protocol can maintain these features by piggybacking a one-bit variable and an n-size vector on each message sent. Our simulation results verify our claim that the presented protocol performs much better than the representative optimized protocol with respect to the forced checkpointing frequency, regardless of the communication pattern. |
first_indexed | 2024-03-10T10:25:02Z |
format | Article |
id | doaj.art-0a6e3a7e8ffc4dcca09f73ab43ff54b8 |
institution | Directory Open Access Journal |
issn | 2079-9292 |
language | English |
last_indexed | 2024-03-10T10:25:02Z |
publishDate | 2021-06-01 |
publisher | MDPI AG |
record_format | Article |
series | Electronics |
spelling | doaj.art-0a6e3a7e8ffc4dcca09f73ab43ff54b82023-11-22T00:02:19ZengMDPI AGElectronics2079-92922021-06-011012142810.3390/electronics10121428Communication-Induced Checkpointing with Message Logging beyond the Piecewise Deterministic (PWD) Model for Distributed SystemsJinho Ahn0Division of AI Computer Science and Engineering, Kyonggi University, Suwon 16227, Gyeonggi, KoreaThis paper introduces an effective communication-induced checkpointing protocol using message logging to enable the number of extra checkpoints to be far lower than the previous number. Even if a situation occurs in which it is decided that a process receiving a message has to perform forced checkpointing, our protocol allows the process to skip the forced checkpointing action if it recognizes that the state of its sender right before the receipt of the message is recoverable. Additionally, the communication-induced checkpointing protocol is thus not required to assume the piecewise deterministic model, despite being combined with message logging. This protocol can maintain these features by piggybacking a one-bit variable and an n-size vector on each message sent. Our simulation results verify our claim that the presented protocol performs much better than the representative optimized protocol with respect to the forced checkpointing frequency, regardless of the communication pattern.https://www.mdpi.com/2079-9292/10/12/1428distributed systemsfault tolerancecheckpointingmessage logging |
spellingShingle | Jinho Ahn Communication-Induced Checkpointing with Message Logging beyond the Piecewise Deterministic (PWD) Model for Distributed Systems Electronics distributed systems fault tolerance checkpointing message logging |
title | Communication-Induced Checkpointing with Message Logging beyond the Piecewise Deterministic (PWD) Model for Distributed Systems |
title_full | Communication-Induced Checkpointing with Message Logging beyond the Piecewise Deterministic (PWD) Model for Distributed Systems |
title_fullStr | Communication-Induced Checkpointing with Message Logging beyond the Piecewise Deterministic (PWD) Model for Distributed Systems |
title_full_unstemmed | Communication-Induced Checkpointing with Message Logging beyond the Piecewise Deterministic (PWD) Model for Distributed Systems |
title_short | Communication-Induced Checkpointing with Message Logging beyond the Piecewise Deterministic (PWD) Model for Distributed Systems |
title_sort | communication induced checkpointing with message logging beyond the piecewise deterministic pwd model for distributed systems |
topic | distributed systems fault tolerance checkpointing message logging |
url | https://www.mdpi.com/2079-9292/10/12/1428 |
work_keys_str_mv | AT jinhoahn communicationinducedcheckpointingwithmessageloggingbeyondthepiecewisedeterministicpwdmodelfordistributedsystems |