Distributed recovery block scheme-based fault tolerant message passing system

This thesis presents a fault-tolerant message passing system incorporating a variation of the distributed recovery block approach. Inter processor communication is one of the key activities of parallel and distributed computer systems. Message passing in large interconnection networks is a critical...

Full description

Bibliographic Details
Main Author: Gu, Wei.
Other Authors: Khan, Gul Nawaz
Format: Thesis
Published: 2008
Subjects:
Online Access:http://hdl.handle.net/10356/2646
_version_ 1826113935483338752
author Gu, Wei.
author2 Khan, Gul Nawaz
author_facet Khan, Gul Nawaz
Gu, Wei.
author_sort Gu, Wei.
collection NTU
description This thesis presents a fault-tolerant message passing system incorporating a variation of the distributed recovery block approach. Inter processor communication is one of the key activities of parallel and distributed computer systems. Message passing in large interconnection networks is a critical part of high performance computing and it has attracted a great deal of attention in the recent years. In many applications, the requirements for efficient inter processor communication and system reliability are increasing. However, in most of the general-purpose parallel and distributed systems, little attention is given to this potential problem. The aim of this research is to develop a fault-tolerant and adaptive message passing system that assures a successful delivery of the messages even under faulty conditions. This thesis presents an investigation of fault-tolerant routing algorithms for unicast, multicast and broadcast that deliver messages as long as a healthy path exists between the source and destination nodes.
first_indexed 2024-10-01T03:31:19Z
format Thesis
id ntu-10356/2646
institution Nanyang Technological University
last_indexed 2024-10-01T03:31:19Z
publishDate 2008
record_format dspace
spelling ntu-10356/26462023-03-04T00:31:16Z Distributed recovery block scheme-based fault tolerant message passing system Gu, Wei. Khan, Gul Nawaz School of Computer Engineering DRNTU::Engineering::Computer science and engineering::Computer systems organization::Computer-communication networks This thesis presents a fault-tolerant message passing system incorporating a variation of the distributed recovery block approach. Inter processor communication is one of the key activities of parallel and distributed computer systems. Message passing in large interconnection networks is a critical part of high performance computing and it has attracted a great deal of attention in the recent years. In many applications, the requirements for efficient inter processor communication and system reliability are increasing. However, in most of the general-purpose parallel and distributed systems, little attention is given to this potential problem. The aim of this research is to develop a fault-tolerant and adaptive message passing system that assures a successful delivery of the messages even under faulty conditions. This thesis presents an investigation of fault-tolerant routing algorithms for unicast, multicast and broadcast that deliver messages as long as a healthy path exists between the source and destination nodes. Master of Engineering (SAS) 2008-09-17T09:06:58Z 2008-09-17T09:06:58Z 2000 2000 Thesis http://hdl.handle.net/10356/2646 Nanyang Technological University application/pdf
spellingShingle DRNTU::Engineering::Computer science and engineering::Computer systems organization::Computer-communication networks
Gu, Wei.
Distributed recovery block scheme-based fault tolerant message passing system
title Distributed recovery block scheme-based fault tolerant message passing system
title_full Distributed recovery block scheme-based fault tolerant message passing system
title_fullStr Distributed recovery block scheme-based fault tolerant message passing system
title_full_unstemmed Distributed recovery block scheme-based fault tolerant message passing system
title_short Distributed recovery block scheme-based fault tolerant message passing system
title_sort distributed recovery block scheme based fault tolerant message passing system
topic DRNTU::Engineering::Computer science and engineering::Computer systems organization::Computer-communication networks
url http://hdl.handle.net/10356/2646
work_keys_str_mv AT guwei distributedrecoveryblockschemebasedfaulttolerantmessagepassingsystem