A survey on potential reactive fault tolerance approach for distributed systems in big data

Due to their unique properties such as high availability and reliability, distributed systems are gaining popularity nowadays. However, the rapid growth of Big Data in distributed systems creates new issues for dataset reliability and availability. In any distributed computer system, the presence an...

Full description

Bibliographic Details
Main Authors: Noraziah, Ahmad, Sharifah Hafizah, Syed Ahmad Ubaidillah, Noor Azida, Sahabudin
Format: Conference or Workshop Item
Language:English
English
Published: SPIE Digital Library 2023
Subjects:
Online Access:http://umpir.ump.edu.my/id/eprint/35156/1/SPIE%20-%20ID-AU1001.pdf
http://umpir.ump.edu.my/id/eprint/35156/7/A%20survey%20on%20potential%20reactive%20fault%20tolerance%20approach.pdf
_version_ 1825814536092909568
author Noraziah, Ahmad
Sharifah Hafizah, Syed Ahmad Ubaidillah
Noor Azida, Sahabudin
author_facet Noraziah, Ahmad
Sharifah Hafizah, Syed Ahmad Ubaidillah
Noor Azida, Sahabudin
author_sort Noraziah, Ahmad
collection UMP
description Due to their unique properties such as high availability and reliability, distributed systems are gaining popularity nowadays. However, the rapid growth of Big Data in distributed systems creates new issues for dataset reliability and availability. In any distributed computer system, the presence and recurrence of failures is an inescapable factor. Both hardware and software components of distributed systems are prone to failure. As a result, the issue of fault tolerance is being recognized as the fundamental theme and essential requirement for the construction and maintenance of the distributed computing paradigm in order to achieve prominence and criticality. Fault tolerance refers to the application that must be executed even in failure conditions by detecting and correcting the fault. Reactive fault tolerance techniques are used to effectively troubleshoot the systems upon occurrences of failures. This paper aims to provide a better understanding of reactive fault tolerance techniques and identifies various approaches used as reactive fault tolerance in distributed systems. Based on the reviews done in this research, there are various reactive fault tolerance techniques that can improve the performance of the distributed systems in terms of availability, reliability, total execution time, and communication cost such as replication, checkpointing task resubmission, and job migration.
first_indexed 2024-03-06T13:00:04Z
format Conference or Workshop Item
id UMPir35156
institution Universiti Malaysia Pahang
language English
English
last_indexed 2024-03-06T13:00:04Z
publishDate 2023
publisher SPIE Digital Library
record_format dspace
spelling UMPir351562023-08-24T03:27:32Z http://umpir.ump.edu.my/id/eprint/35156/ A survey on potential reactive fault tolerance approach for distributed systems in big data Noraziah, Ahmad Sharifah Hafizah, Syed Ahmad Ubaidillah Noor Azida, Sahabudin QA76 Computer software Due to their unique properties such as high availability and reliability, distributed systems are gaining popularity nowadays. However, the rapid growth of Big Data in distributed systems creates new issues for dataset reliability and availability. In any distributed computer system, the presence and recurrence of failures is an inescapable factor. Both hardware and software components of distributed systems are prone to failure. As a result, the issue of fault tolerance is being recognized as the fundamental theme and essential requirement for the construction and maintenance of the distributed computing paradigm in order to achieve prominence and criticality. Fault tolerance refers to the application that must be executed even in failure conditions by detecting and correcting the fault. Reactive fault tolerance techniques are used to effectively troubleshoot the systems upon occurrences of failures. This paper aims to provide a better understanding of reactive fault tolerance techniques and identifies various approaches used as reactive fault tolerance in distributed systems. Based on the reviews done in this research, there are various reactive fault tolerance techniques that can improve the performance of the distributed systems in terms of availability, reliability, total execution time, and communication cost such as replication, checkpointing task resubmission, and job migration. SPIE Digital Library 2023-02 Conference or Workshop Item PeerReviewed pdf en http://umpir.ump.edu.my/id/eprint/35156/1/SPIE%20-%20ID-AU1001.pdf pdf en http://umpir.ump.edu.my/id/eprint/35156/7/A%20survey%20on%20potential%20reactive%20fault%20tolerance%20approach.pdf Noraziah, Ahmad and Sharifah Hafizah, Syed Ahmad Ubaidillah and Noor Azida, Sahabudin (2023) A survey on potential reactive fault tolerance approach for distributed systems in big data. In: 2022 3rd International Conference on Computer Vision and Information Technology (CVIT 2022) , 19 - 21 August 2022 , Beijing, China. pp. 1-8., 12590 (1259009). (Published) https://doi.org/10.1117/12.2670017
spellingShingle QA76 Computer software
Noraziah, Ahmad
Sharifah Hafizah, Syed Ahmad Ubaidillah
Noor Azida, Sahabudin
A survey on potential reactive fault tolerance approach for distributed systems in big data
title A survey on potential reactive fault tolerance approach for distributed systems in big data
title_full A survey on potential reactive fault tolerance approach for distributed systems in big data
title_fullStr A survey on potential reactive fault tolerance approach for distributed systems in big data
title_full_unstemmed A survey on potential reactive fault tolerance approach for distributed systems in big data
title_short A survey on potential reactive fault tolerance approach for distributed systems in big data
title_sort survey on potential reactive fault tolerance approach for distributed systems in big data
topic QA76 Computer software
url http://umpir.ump.edu.my/id/eprint/35156/1/SPIE%20-%20ID-AU1001.pdf
http://umpir.ump.edu.my/id/eprint/35156/7/A%20survey%20on%20potential%20reactive%20fault%20tolerance%20approach.pdf
work_keys_str_mv AT noraziahahmad asurveyonpotentialreactivefaulttoleranceapproachfordistributedsystemsinbigdata
AT sharifahhafizahsyedahmadubaidillah asurveyonpotentialreactivefaulttoleranceapproachfordistributedsystemsinbigdata
AT noorazidasahabudin asurveyonpotentialreactivefaulttoleranceapproachfordistributedsystemsinbigdata
AT noraziahahmad surveyonpotentialreactivefaulttoleranceapproachfordistributedsystemsinbigdata
AT sharifahhafizahsyedahmadubaidillah surveyonpotentialreactivefaulttoleranceapproachfordistributedsystemsinbigdata
AT noorazidasahabudin surveyonpotentialreactivefaulttoleranceapproachfordistributedsystemsinbigdata