A time–energy performance analysis of MapReduce on heterogeneous systems with GPUs

Dumitrel Loghin,Lavanya Ramapantulu,Oana Barbu,Yong Meng Teo

doi:10.1016/j.peva.2015.06.015

Abstract

Motivated by the explosion of Big Data analytics, performance improvements in low-power (wimpy) systems and the increasing energy efficiency of GPUs, this paper presents a time–energy performance analysis of MapReduce on heterogeneous systems with GPUs. We evaluate the time and energy performance of three MapReduce applications with diverse resource demands on a Hadoop–CUDA framework. As executing these applications on heterogeneous systems with GPUs is challenging, we introduce a novel lazy processing technique which requires no modifications to the underlying Hadoop framework. To analyze the impact of heterogeneity, we compare the heterogeneous CPU+GPU with the homogeneous CPU-only execution across three systems with diverse characteristics, (i) a traditional high-performance (brawny) Intel i7 system hosting a discrete 640-core Nvidia GPU of the latest Maxwell generation, (ii) a wimpy platform consisting of a quad-core ARM Cortex-A9 hosting the same discrete Maxwell GPU, and (iii) a wimpy platform integrating four ARM Cortex-A15 cores and 192 Nvidia Kepler GPU cores on the same chip. These systems encompass both intra-node heterogeneity with discrete GPUs and intra-chip heterogeneity with integrated GPUs. Our measurement-based performance analysis highlights the following results. For compute-intensive workloads, the brawny heterogeneous system achieves speedups of up to 2.3 and reduces the energy usage by almost half compared to the brawny homogeneous system. As expected, for applications where data transfers dominate the execution time, heterogeneity exhibits worse time–energy performance compared to homogeneous systems. For such applications, the heterogeneous wimpy A9 system with discrete GPU uses around 14 times the energy of homogeneous A9 system due to both system resource imbalances and high power overhead of the discrete GPU. However, comparing among heterogeneous systems, the wimpy A15 with integrated GPU uses the lowest energy across all workloads. This allows us to establish an execution time equivalence ratio between a single brawny node and multiple wimpy nodes. Based on this equivalence ratio, the wimpy nodes exhibit energy savings of two-thirds while maintaining the same execution time. This result advocates the potential usage of heterogeneous wimpy systems with integrated GPUs for Big Data analytics.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A time–energy performance analysis of MapReduce on heterogeneous systems with GPUs

Abstract

Talk to us

Similar Papers

More From: Performance Evaluation

Lead the way for us

Journal: Performance Evaluation	Publication Date: Jul 6, 2015
Citations: 19

Similar Papers

Parallelisation of equation-based simulation programs on heterogeneous computing systems.
Dragan D Nikolić
PeerJ Computer Science | VOL. 4
Dragan D NikolićDragan D Nikolić
13 Aug 2018
PeerJ Computer Science | VOL. 4

HeTM: Transactional Memory for Heterogeneous Systems
Daniel Castro ... Aleksandar Ilic
-
Daniel Castro, et. al.Daniel Castro ... Aleksandar Ilic
01 Sep 2019
01 Sep 2019

Size Distributions of Mesophase Microbeads Obtained from Heterogeneous and Homogeneous Systems
Wen Wen Zhang ... Zhen Fan
Key Engineering Materials | VOL. 609-610
Wen Wen Zhang, et. al.Wen Wen Zhang ... Zhen Fan
01 Apr 2014
Key Engineering Materials | VOL. 609-610

Origin of rate dispersion in translational diffusion: Distinguishing heterogeneous from homogeneous using 2D correlation analysis
Ruchir Gupta ... Sachin Dev Verma
Chemical Physics Impact | VOL. 7
Ruchir Gupta, et. al.Ruchir Gupta ... Sachin Dev Verma
28 Sep 2023
Chemical Physics Impact | VOL. 7

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A time–energy performance analysis of MapReduce on heterogeneous systems with GPUs

Abstract

Talk to us

Similar Papers

More From: Performance Evaluation