Hadoop MapReduce Multi-Job Workloads using Resource Aware scheduler

Narayanan Shivakumar ,Anirban Basu Rashmi

doi:10.26483/ijarcs.v5i6.2238

Abstract

Cloud computing features a flexible computing infrastructure for large-scale data processing. MapReduce is a typical model providing an logical framework for cloud computing and Hadoop, an open-source implementation of MapReduce, is a common platform to realize such kind of parallel computing model. We present a resource-aware scheduling technique for MapReduce multi-job workloads that aims at improving resource utilization across machines while observing completion time goals. Existing MapReduce schedulers define a static number of slots to represent the capacity of a cluster, creating a fixed number of execution slots per machine. This abstraction works for homogeneous workloads, but fails to capture the different resource requirements of individual jobs in multi-user environments. Our technique leverages job profiling information to dynamically adjust the number of slots on each machine, as well as workload placement across them, to maximize the resource utilization of the cluster. Key Words- Map Reduce, scheduling, resource-awareness, performance Management, Large-Scale Data Processing, Hadoop.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Hadoop MapReduce Multi-Job Workloads using Resource Aware scheduler

Abstract

Talk to us

Similar Papers

More From: International Journal of Advanced Research in Computer Science

Lead the way for us

Similar Papers

BAR: An Efficient Data Locality Driven Task Scheduling Algorithm for Cloud Computing
...
-
, et. al. ...
23 May 2011
23 May 2011

Resource-Aware Adaptive Scheduling for MapReduce Clusters
Jordà Polo ... David Carrera
-
Jordà Polo, et. al.Jordà Polo ... David Carrera
01 Jan 2010
01 Jan 2010

An Improved Scheduling Algorithm on the Hadoop Platform
...
-
, et. al. ...
26 Oct 2014
26 Oct 2014

Large-scale data mining analytics based on MapReduce

-

01 Jan 2014
01 Jan 2014

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Hadoop MapReduce Multi-Job Workloads using Resource Aware scheduler

Abstract

Talk to us

Similar Papers

More From: International Journal of Advanced Research in Computer Science