Aquesta és una còpia de la versió author's final draft d'un article publicat a la revista Cluster computing. Abstract A real challenge sits in front of the business solutions these days, in the context of the big amount of data generated by complex software applications: eciently using the given limited resources to accomplish specific operations and tasks. Depending on the type of application dealing with, when trying to deliver a certain service in a specific time and with a limited budget, a sequential application may be redesigned in a convenient way so that it will become scalable and able to run on multiple resources. In this context, Many Task Computing (MTC) model brings together loosely coupled applications, composed of many dependent/independent tasks, which will work together for a common result. When asking for a certain service, the most frequently constraints addressed by the user are deadline and budget. However, depending on the tasks nature used in MTC, other constraints may also occur: tasks may be data intensive or computing intensive, independent or dependent, uni-processor or multi-processor. In this context, we propose a multiobjective scheduling algorithm of many tasks in Hadoop for Big Data processing, named MOMTH. The algorithm evaluation was realized in Scheduling Load Simulator, integrated in Hadoop and easy to use. We compared the proposed algorithm with FIFO and Fair Schedulers and we obtained similar performance for our approach.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.