On the Use of GPU for Accelerating Communication-Aware Mapping Techniques

Guillermo Vigueras,Juan M Orduña

doi:10.1093/comjnl/bxv037

Abstract

Different communication-aware mapping techniques were proposed in recent years for improving the performance of distributed systems based on both, off-chip and on-chip networks. Some of these proposals were based on heuristic search for finding pseudo-optimal assignments of tasks and processing elements. However, the technology integration improvements have allowed a significant increase in the number of network nodes, requiring the acceleration of the heuristic search. In this paper, we propose a comparative study of the local search method used in a communication-aware mapping technique, when implemented on different parallel architectures. We compare the performance provided by a version of the local search method when executed on a single Graphics Processing Unit (GPU) with the one provided by the MPI version executed on a supercomputer with the same theoretical performance of the GPU platform, in order to study a fair scenario. We have considered a GPU based on the Fermi architecture, evaluating the improvements achieved by some new architectural features of this platform. The results show that a mixed parallel implementation on a single GPU outperforms the MPI implementation of the local search method. These results validate the GPU implementation as a very cost-effective accelerator for the local search method.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

On the Use of GPU for Accelerating Communication-Aware Mapping Techniques

Abstract

Talk to us

Similar Papers

More From: The Computer Journal

Lead the way for us

Journal: The Computer Journal	Publication Date: May 29, 2015
Citations: 3

Similar Papers

Reduction of computing time for seismic applications based on the Helmholtz equation by Graphics Processing Units

-

03 Mar 2015
03 Mar 2015

High Performance Graph Data Imputation on Multiple GPUs
Chao Zhou ... Tao Zhang
Future Internet | VOL. 13
Chao Zhou, et. al.Chao Zhou ... Tao Zhang
31 Jan 2021
Future Internet | VOL. 13

High-Performance Homomorphic Matrix Completion on Multiple GPUs
Tao Zhang ... Han Lu
IEEE Access | VOL. 8
Tao Zhang, et. al.Tao Zhang ... Han Lu
01 Jan 2020
IEEE Access | VOL. 8

Harnessing the power of idle GPUs for acceleration of biological sequence alignment
Fumihiko Ino ... Kenichi Hagihara
-
Fumihiko Ino, et. al.Fumihiko Ino ... Kenichi Hagihara
01 May 2009
01 May 2009

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

On the Use of GPU for Accelerating Communication-Aware Mapping Techniques

Abstract

Talk to us

Similar Papers

More From: The Computer Journal