AN MPI PERFORMANCE MONITORING INTERFACE FOR CELL BASED COMPUTE NODES

Hikmet Dursun,Kevin J Barker,Aiichiro Nakano,Rajiv K Kalia,Scott Pakin,Richard Seymour,Darren J Kerbyson,Priya Vashishta

doi:10.1142/s0129626409000407

Abstract

In this paper, we present a methodology for profiling parallel applications executing on the family of architectures commonly referred as the "Cell" processor. Specifically, we examine Cell-centric MPI programs on hybrid clusters containing multiple Opteron and IBM PowerXCell 8i processors per node such as those used in the petascale Roadrunner system. We analyze the performance of our approach on a PlayStation3 console based on Cell Broadband Engine—the CBE—as well as an IBM BladeCenter QS22 based on PowerXCell 8i. Our implementation incurs less than 0.5% overhead and 0.3 µs per profiler call for a typical molecular dynamics code on the Cell BE while efficiently utilizing the limited local store of the Cell's SPE cores. Our worst-case overhead analysis on the PowerXCell 8i costs 3.2 µs per profiler call while using only two 5 KiB buffers. We demonstrate the use of our profiler on a cluster of hybrid nodes running a suite of scientific applications. Our analyses of inter-SPE communication (across the entire cluster) and function call patterns provide valuable information that can be used to optimize application performance.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

AN MPI PERFORMANCE MONITORING INTERFACE FOR CELL BASED COMPUTE NODES

Abstract

Talk to us

Similar Papers

More From: Parallel Processing Letters

Lead the way for us

Journal: Parallel Processing Letters	Publication Date: Dec 1, 2009
Citations: 9

Similar Papers

Application profiling on Cell-based clusters
Hikmet Dursun ... Darren J Kerbyson
-
Hikmet Dursun, et. al.Hikmet Dursun ... Darren J Kerbyson
01 May 2009
01 May 2009

QPACE: Quantum Chromodynamics Parallel Computing on the Cell Broadband Engine
...
Computing in Science & Engineering | VOL. 10
, et. al. ...
01 Nov 2008
Computing in Science & Engineering | VOL. 10

Efficient SIMDization and Data Management of the Lattice QCD Computation on the Cell Broadband Engine
Khaled Z Ibrahim ... François Bodin
Scientific Programming | VOL. 17
Khaled Z Ibrahim, et. al.Khaled Z Ibrahim ... François Bodin
01 Jan 2009
Scientific Programming | VOL. 17

IBM BladeCenter QS22: Design, performance, and utilization in hybrid computing systems
J.-S Vogt ... H Boettiger
IBM Journal of Research and Development | VOL. 53
J.-S Vogt, et. al.J.-S Vogt ... H Boettiger
01 Sep 2009
IBM Journal of Research and Development | VOL. 53

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

AN MPI PERFORMANCE MONITORING INTERFACE FOR CELL BASED COMPUTE NODES

Abstract

Talk to us

Similar Papers

More From: Parallel Processing Letters