Optimizing point‐to‐point communication between adaptive MPI endpoints in shared memory

Sam White,Laxmikant V Kale

doi:10.1002/cpe.4467

Sam White, Laxmikant V Kale

Open Access

https://doi.org/10.1002/cpe.4467

Copy DOI

Abstract

SummaryAdaptive MPI is an implementation of the MPI standard that supports the virtualization of ranks as user‐level threads, rather than OS processes. In this work, we optimize the communication performance of AMPI based on the locality of the endpoints communicating within a cluster of SMP nodes. We differentiate between point‐to‐point messages with both endpoints co‐located on the same execution unit and point‐to‐point messages with both endpoints residing in the same process but not on the same execution unit. We demonstrate how the messaging semantics of Charm++ enable and hinder AMPI's implementation in different ways, and we motivate extensions to Charm++ to address the limitations. Using the OSU micro‐benchmark suite, we show that our locality‐aware design offers lower latency, higher bandwidth, and reduced memory footprint for applications.

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Concurrency and Computation: Practice and Experience	Publication Date: Mar 12, 2018
Citations: 7	License type: publisher-specific, author manuscript

R Discovery Prime

R Discovery Prime

Optimizing point‐to‐point communication between adaptive MPI endpoints in shared memory

Abstract

Talk to us

Similar Papers

More From: Concurrency and Computation: Practice and Experience

Lead the way for us

Similar Papers

Fine-grain software distributed shared memory on SMP clusters
D.J Scales ... K Gharachorloo
-
D.J Scales, et. al.D.J Scales ... K Gharachorloo
31 Jan 1998
31 Jan 1998

On Using an Hybrid MPI-Thread Programming for the Implementation of a Parallel Sparse Direct Solver on a Network of SMP Nodes
Pascal Hénon ... Pierre Ramet
-
Pascal Hénon, et. al.Pascal Hénon ... Pierre Ramet
01 Jan 2006
01 Jan 2006

Pipelined Scheduling of Tiled Nested Loops onto Clusters of SMPs Using Memory Mapped Network Interfaces
...
-
, et. al. ...
16 Nov 2002
16 Nov 2002

Pipelined Scheduling of Tiled Nested Loops onto Clusters of SMPs Using Memory Mapped Network Interfaces
M Athanasaki ... A Sotiropoulos
-
M Athanasaki, et. al.M Athanasaki ... A Sotiropoulos
01 Jan 2002
01 Jan 2002

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Optimizing point‐to‐point communication between adaptive MPI endpoints in shared memory

Abstract

Talk to us

Similar Papers

More From: Concurrency and Computation: Practice and Experience