Dynamic Performance Aware Reduce Task Scheduling in MapReduce on Virtualized Environment

Rathinaraja Jeyaraj,V S Ananthanarayana

doi:10.1109/sera.2018.8477195

Abstract

Hadoop MapReduce as a service from cloud is widely used by various research, and commercial communities. Hadoop MapReduce is typically offered as a service hosted on virtualized environment in Cloud Data-Center. Cluster of virtual machines for MapReduce is placed across racks in Cloud Data-Center to achieve fault tolerance. But, it negatively introduces dynamic/heterogeneous performance for virtual machines due to hardware heterogeneity and co-located virtual machine's interference, which cause varying latency for same task. Alongside, curbing number of intermediate records and placing reduce tasks on right virtual node are also important to minimize MapReduce job latency further. In this paper, we introduce Multi-Level Per Node Combiner to minimize the number of intermediate records and Dynamic Ranking based MapReduce Job Scheduler to place reduce tasks on right virtual machine to minimize MapReduce job latency by exploiting dynamic performance of virtual machines. To experiment and evaluate, we launched 29 virtual machines hosted in eight different physical machines to run wordcount job on PUMA dataset. Our proposed methodology improves overall job latency up to 33% for wordcount job.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Dynamic Performance Aware Reduce Task Scheduling in MapReduce on Virtualized Environment

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Dynamic ranking-based MapReduce job scheduler to exploit heterogeneous performance in a virtualized environment
J Rathinaraja ... Anand Paul
The Journal of Supercomputing | VOL. 75
J Rathinaraja, et. al.J Rathinaraja ... Anand Paul
01 Aug 2019
The Journal of Supercomputing | VOL. 75

Multi-level per node combiner (MLPNC) to minimize mapreduce job latency on virtualized environment
Rathinaraja Jeyaraj ... Ananthanarayana V S
-
Rathinaraja Jeyaraj, et. al.Rathinaraja Jeyaraj ... Ananthanarayana V S
09 Apr 2018
09 Apr 2018

Handling Non-Local Executions to Improve MapReduce Performance Using Ant Colony Optimization
Gurwinder Singh ... Anil Sharma
IEEE Access | VOL. 9
Gurwinder Singh, et. al.Gurwinder Singh ... Anil Sharma
01 Jan 2020
IEEE Access | VOL. 9

Integrating QoS awareness with virtualization in cloud computing systems for delay-sensitive applications
Jenn-Wei Lin ... Chi-Yi Lin
Future Generation Computer Systems | VOL. 37
Jenn-Wei Lin, et. al.Jenn-Wei Lin ... Chi-Yi Lin
09 Jan 2014
Future Generation Computer Systems | VOL. 37

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Dynamic Performance Aware Reduce Task Scheduling in MapReduce on Virtualized Environment

Abstract

Talk to us

Similar Papers