MapReduce scheduling algorithms: a review

Recent trends in big data have shown that the amount of data continues to increase at an exponential rate. This trend has inspired many researchers over the past few years to explore new research direction of studies related to multiple areas of big data. The widespread popularity of big data proces...

Full description

Bibliographic Details
Main Authors: Hashem, Ibrahim Abaker Targio, Nor Badrul, Anuar, Marjani, Mohsen, Ahmed, Ejaz, Chiroma, Haruna, Ahmad Firdaus, Zainal Abidin, Muhamad Taufik, Abdullah, Faiz, Alotaibi, Mahmoud Ali, Waleed Kamaleldin, Yaqoob, Ibrar, Abdullah, Gani
Format: Article
Language:English
English
Published: Springer 2020
Subjects:
Online Access:http://umpir.ump.edu.my/id/eprint/30281/1/MapReduce%20scheduling%20algorithms-%20a%20review.pdf
http://umpir.ump.edu.my/id/eprint/30281/2/MapReduce%20scheduling%20algorithms-a%20review_FULL.pdf
Description
Summary:Recent trends in big data have shown that the amount of data continues to increase at an exponential rate. This trend has inspired many researchers over the past few years to explore new research direction of studies related to multiple areas of big data. The widespread popularity of big data processing platforms using MapReduce framework is the growing demand to further optimize their performance for various purposes. In particular, enhancing resources and jobs scheduling are becoming critical since they fundamentally determine whether the applications can achieve the performance goals in different use cases. Scheduling plays an important role in big data, mainly in reducing the execution time and cost of processing. This paper aims to survey the research undertaken in the field of scheduling in big data platforms. Moreover, this paper analyzed scheduling in MapReduce on two aspects: taxonomy and performance evaluation. The research progress in MapReduce scheduling algorithms is also discussed. The limitations of existing MapReduce scheduling algorithms and exploit future research opportunities are pointed out in the paper for easy identification by researchers. Our study can serve as the benchmark to expert researchers for proposing a novel MapReduce scheduling algorithm. However, for novice researchers, the study can be used as a starting point.