Self-Learning MapReduce Scheduler in Multi-job Environment

Changhang Lin Changhang Lin,Wenzhong Guo Wenzhong Guo,Changhui Lin Changhui Lin

doi:10.1109/cloudcom-asia.2013.95

Changhang Lin Changhang Lin, Wenzhong Guo Wenzhong Guo + Show 1 more

https://doi.org/10.1109/cloudcom-asia.2013.95

Copy DOI

Abstract

Hadoop, as the most widely adopted open-source implementation of MapReduce framework, makes MapReduce widely accessible. However, it is currently limited by its default MapReduce scheduler. To achieve better performance, the scheduler should take into consideration nodes' computing power and system resources in heterogeneous environment. Further more, from job perspective, tasks' non-linear progress is also an important factor. Some research work has been carried out to enhance the performance of MapReduce, but they are not satisfactory in terms of considering characteristics of both nodes and jobs. To overcome this drawback, we propose a Self-Learning MapReduce Scheduler (SLM), which outperforms the existing schedulers in multi-job environment. Since competitions on system resources may make a task's progress unpredictable, SLM determines the progress of each job based on its own historical information. In particular, on the self-learning stage of a job, with the feedback information from the first few tasks, SLM calculates the task phase weights. With these phase weights, SLM can obtain more accurate execution time estimation, which is the most important condition to finding stragglers (slow tasks). Experimental results show that, SLM can effectively improve the accuracy of execution time estimation and straggler identification, leading to the rational utilization of resources and shortening jobs' execution time especially in multi-job environment.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Self-Learning MapReduce Scheduler in Multi-job Environment

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

SAMR: A Self-adaptive MapReduce Scheduling Algorithm in Heterogeneous Environment
Quan Chen ... Song Guo
-
Quan Chen, et. al.Quan Chen ... Song Guo
01 Jun 2010
01 Jun 2010

FiGMR: A fine-grained MapReduce scheduler in the heterogeneous cloud
Yingchi Mao ... Xiaofang Li
-
Yingchi Mao, et. al.Yingchi Mao ... Xiaofang Li
01 Aug 2016
01 Aug 2016

A Fine-Grained and Dynamic MapReduce Task Scheduling Scheme for the Heterogeneous Cloud Environment
Yingchi Mao ... Haishi Zhong
-
Yingchi Mao, et. al.Yingchi Mao ... Haishi Zhong
01 Aug 2015
01 Aug 2015

HAT: history-based auto-tuning MapReduce in heterogeneous environments
Quan Chen ... Long Zheng
The Journal of Supercomputing | VOL. 64
Quan Chen, et. al.Quan Chen ... Long Zheng
23 Sep 2011
The Journal of Supercomputing | VOL. 64

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Self-Learning MapReduce Scheduler in Multi-job Environment

Abstract

Talk to us

Similar Papers