An Analytical Approach for Optimizing the Performance of Hadoop Map Reduce Over RoCE

Geetha J Geetha J,Chenna Reddy P Chenna Reddy P,Uday Bhaskar N Uday Bhaskar N

doi:10.4018/ijicthd.2018040101

Abstract

Data intensive systems aim to efficiently process “big” data. Several data processing engines have evolved over past decade. These data processing engines are modeled around the MapReduce paradigm. This article explores Hadoop's MapReduce engine and propose techniques to obtain a higher level of optimization by borrowing concepts from the world of High Performance Computing. Consequently, power consumed and heat generated is lowered. This article designs a system with a pipelined dataflow in contrast to the existing unregulated “bursty” flow of network traffic, the ability to carry out both Map and Reduce tasks in parallel, and a system which incorporates modern high-performance computing concepts using Remote Direct Memory Access (RDMA). To establish the claim of an increased performance measure of the proposed system, the authors provide an algorithm for RoCE enabled MapReduce and a mathematical derivation contrasting the runtime of vanilla Hadoop. This article proves mathematically, that the proposed system functions 1.67 times faster than the vanilla version of Hadoop.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

An Analytical Approach for Optimizing the Performance of Hadoop Map Reduce Over RoCE

Abstract

Talk to us

Similar Papers

More From: International Journal of Information Communication Technologies and Human Development

Lead the way for us

Journal: International Journal of Information Communication Technologies and Human Development	Publication Date: Apr 1, 2018
Citations: 4

Similar Papers

Scalable connectionless RDMA over unreliable datagrams
Ryan E Grant ... Ahmad Afsahi
Parallel Computing | VOL. 48
Ryan E Grant, et. al.Ryan E Grant ... Ahmad Afsahi
11 Apr 2015
Parallel Computing | VOL. 48

RVMA: Remote Virtual Memory Access
Ryan E Grant ... Matthew G.F Dosanjh
-
Ryan E Grant, et. al.Ryan E Grant ... Matthew G.F Dosanjh
01 May 2021
01 May 2021

A Performance Study to Guide RDMA Programming Decisions
Patrick Macarthur ... Robert D Russell
-
Patrick Macarthur, et. al.Patrick Macarthur ... Robert D Russell
01 Jun 2012
01 Jun 2012

Evaluation of RDMA Opportunities in an Object-Oriented DSM
Ronald Veldema ... Michael Philippsen
-
Ronald Veldema, et. al.Ronald Veldema ... Michael Philippsen
01 Oct 2007
01 Oct 2007

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

An Analytical Approach for Optimizing the Performance of Hadoop Map Reduce Over RoCE

Abstract

Talk to us

Similar Papers

More From: International Journal of Information Communication Technologies and Human Development