Communication-Induced Checkpointing with Message Logging beyond the Piecewise Deterministic (PWD) Model for Distributed Systems

This paper introduces an effective communication-induced checkpointing protocol using message logging to enable the number of extra checkpoints to be far lower than the previous number. Even if a situation occurs in which it is decided that a process receiving a message has to perform forced checkpo...

Full description

Bibliographic Details
Main Author: Jinho Ahn
Format: Article
Language:English
Published: MDPI AG 2021-06-01
Series:Electronics
Subjects:
Online Access:https://www.mdpi.com/2079-9292/10/12/1428
_version_ 1797530160974528512
author Jinho Ahn
author_facet Jinho Ahn
author_sort Jinho Ahn
collection DOAJ
description This paper introduces an effective communication-induced checkpointing protocol using message logging to enable the number of extra checkpoints to be far lower than the previous number. Even if a situation occurs in which it is decided that a process receiving a message has to perform forced checkpointing, our protocol allows the process to skip the forced checkpointing action if it recognizes that the state of its sender right before the receipt of the message is recoverable. Additionally, the communication-induced checkpointing protocol is thus not required to assume the piecewise deterministic model, despite being combined with message logging. This protocol can maintain these features by piggybacking a one-bit variable and an n-size vector on each message sent. Our simulation results verify our claim that the presented protocol performs much better than the representative optimized protocol with respect to the forced checkpointing frequency, regardless of the communication pattern.
first_indexed 2024-03-10T10:25:02Z
format Article
id doaj.art-0a6e3a7e8ffc4dcca09f73ab43ff54b8
institution Directory Open Access Journal
issn 2079-9292
language English
last_indexed 2024-03-10T10:25:02Z
publishDate 2021-06-01
publisher MDPI AG
record_format Article
series Electronics
spelling doaj.art-0a6e3a7e8ffc4dcca09f73ab43ff54b82023-11-22T00:02:19ZengMDPI AGElectronics2079-92922021-06-011012142810.3390/electronics10121428Communication-Induced Checkpointing with Message Logging beyond the Piecewise Deterministic (PWD) Model for Distributed SystemsJinho Ahn0Division of AI Computer Science and Engineering, Kyonggi University, Suwon 16227, Gyeonggi, KoreaThis paper introduces an effective communication-induced checkpointing protocol using message logging to enable the number of extra checkpoints to be far lower than the previous number. Even if a situation occurs in which it is decided that a process receiving a message has to perform forced checkpointing, our protocol allows the process to skip the forced checkpointing action if it recognizes that the state of its sender right before the receipt of the message is recoverable. Additionally, the communication-induced checkpointing protocol is thus not required to assume the piecewise deterministic model, despite being combined with message logging. This protocol can maintain these features by piggybacking a one-bit variable and an n-size vector on each message sent. Our simulation results verify our claim that the presented protocol performs much better than the representative optimized protocol with respect to the forced checkpointing frequency, regardless of the communication pattern.https://www.mdpi.com/2079-9292/10/12/1428distributed systemsfault tolerancecheckpointingmessage logging
spellingShingle Jinho Ahn
Communication-Induced Checkpointing with Message Logging beyond the Piecewise Deterministic (PWD) Model for Distributed Systems
Electronics
distributed systems
fault tolerance
checkpointing
message logging
title Communication-Induced Checkpointing with Message Logging beyond the Piecewise Deterministic (PWD) Model for Distributed Systems
title_full Communication-Induced Checkpointing with Message Logging beyond the Piecewise Deterministic (PWD) Model for Distributed Systems
title_fullStr Communication-Induced Checkpointing with Message Logging beyond the Piecewise Deterministic (PWD) Model for Distributed Systems
title_full_unstemmed Communication-Induced Checkpointing with Message Logging beyond the Piecewise Deterministic (PWD) Model for Distributed Systems
title_short Communication-Induced Checkpointing with Message Logging beyond the Piecewise Deterministic (PWD) Model for Distributed Systems
title_sort communication induced checkpointing with message logging beyond the piecewise deterministic pwd model for distributed systems
topic distributed systems
fault tolerance
checkpointing
message logging
url https://www.mdpi.com/2079-9292/10/12/1428
work_keys_str_mv AT jinhoahn communicationinducedcheckpointingwithmessageloggingbeyondthepiecewisedeterministicpwdmodelfordistributedsystems