SHMEMPMI -- Shared Memory Based PMI for Improved Performance and Scalability

Sourav Chakraborty,Dhabaleswar K Panda,Hari Subramoni,Jonathan Perkins

doi:10.1109/ccgrid.2016.99

Abstract

Dense systems with large number of cores per node are becoming increasingly popular. Existing designs of the Process Management Interface (PMI) show poor scalability in terms of performance and memory consumption on such systems with large number of processes concurrently accessing the PMI interface. Our analysis shows the local socket-based communication scheme used by PMI to be a major bottleneck. While using a shared memory based channel can avoid this bottleneck and thus reduce memory consumption and improve performance, there are several challenges associated with such a design. We investigate several such alternatives and propose a novel design that is based on a hybrid socket+shared memory based communication protocol and uses multiple shared memory regions. This design can reduce the memory usage per node by a factor of Processes per Node. Our evaluations show that memory consumption per node can be reduced by an estimated 1 GB with 1 million MPI processes and 16 processes per node. Additionally, performance of PMI Get is improved by 1,000 times compared to the existing design. The proposed design is backward compatible, secure, and imposes negligible overhead.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

SHMEMPMI -- Shared Memory Based PMI for Improved Performance and Scalability

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

A non-conformal domain decomposition method utilizing rotating subdomains and non-matching grids for periodic metamaterial simulation
Hangxin Liu ... Bingqi Liu
AIP Advances | VOL. 14
Hangxin Liu, et. al.Hangxin Liu ... Bingqi Liu
01 Apr 2024
AIP Advances | VOL. 14

Parameterized Splitting of Summed Volume Tables
Christian Reinbold ... Rüdiger Westermann
Computer Graphics Forum | VOL. 40
Christian Reinbold, et. al.Christian Reinbold ... Rüdiger Westermann
01 Jun 2021
Computer Graphics Forum | VOL. 40

A GCN-GRU Based End-to-End LEO Satellite Network Dynamic Topology Prediction Method
Yan Chen ... Daojin Chen
-
Yan Chen, et. al.Yan Chen ... Daojin Chen
01 Mar 2023
01 Mar 2023

3D Reconstruction Based on Cyclic Multi-View Stereo Network
Fangli Jia ... Yongheng Tang
-
Fangli Jia, et. al.Fangli Jia ... Yongheng Tang
01 Apr 2020
01 Apr 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

SHMEMPMI -- Shared Memory Based PMI for Improved Performance and Scalability

Abstract

Talk to us

Similar Papers