Cloud computing has emerged as a model that harnesses massive capacities of data centers to host services in a costeffective manner. MapReduce has been widely used as a Big Data processing platform, proposed by Google in 2004 and has become a popular parallel computing framework for large-scale data processing since then. It is best suited for embarrassingly parallel and data-intensive tasks. It is designed to read large amount of data stored in a distributed file system such as Google File System (GFS), process the data in parallel, aggregate and store the results back to the distributed file system. Scheduling is one of the most critical aspects of MapReduce. Also three important scheduling issues in MapReduce such as locality, synchronization and fairness exist. This paper tries to illustrate and analyze the overview of thirteen different aware scheduling algorithms with different techniques and approaches for MapReduce in Hadoop and their scheduling issues and problems. At the end, Advantages and disadvantages of these algorithms are identified.
A heterogeneous computing environment is a large-scale distributed data processing environment, it is depends to some extent parameters on the application and that classified in three main categories such as the hardware, the communication layer, and the software. A computer system is consists of hardware and software from two or more different manufacturers. Scheduling is one of the important factors in the heterogeneous environment and the aim of task scheduling in the processing environment is to move computation towards data. In order to achieve improve performance, increase the throughput and minimizing the makespan; scheduler must avoid unnecessary data transmission. Hence, different scheduling algorithms for heterogeneous computing environment are necessary to provide good performance. How to speedup scheduling the service resources to achieve the lowest cost becomes more and more important. This paper tries to illustrate and analyze the overview of eighteen different scheduling algorithms for heterogeneous computing environment and their scheduling issues and problems.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.