High Performance and Enhanced Scalability for Parallel Applications using MPI-3’s non-blocking Collectives

Surendra Varma Pericherla,Sathish Vadhiyar

doi:10.1016/j.procs.2017.05.199

High Performance and Enhanced Scalability for Parallel Applications using MPI-3’s non-blocking Collectives

Surendra Varma Pericherla, Sathish Vadhiyar

Open Access

https://doi.org/10.1016/j.procs.2017.05.199

Copy DOI

Export

Save

Cite

Journal: Procedia Computer Science	Publication Date: Jan 1, 2017
License type: cc-by-nc-nd

Affiliation: Indian Institute of Science Bangalore

#Non-blocking Collective #Non-blocking Collectives #Cray Supercomputer #Scalability Bottlenecks #MPI Applications #Collective Communications #Machine Learning Applications #Graph Learning #High Performance #Strategies For Applications

Abstract
Full-Text
Similar Papers

Abstract

Listen

Collective communications occupy 20-90% of total execution times in many MPI applications. In this paper, we propose strategies for automatically identifying the most time-consuming collective operations that also act as scalability bottlenecks. We then explore the use of MPI-3’s non-blocking collectives for these communications. We also rearrange the codes to adequately overlap the independent computations with the non-blocking collective communications. Applying these strategies for different graph and machine learning applications, we obtained up to 33% performance improvements for large-scale runs on a Cray supercomputer.

Full Text

Published Version

View

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Similar Papers

Paper Title

Journal

Date

Author

View more papers

More From: Procedia Computer Science

Paper Title

Journal

Date

Author

View more papers

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.

R Discovery Prime

High Performance and Enhanced Scalability for Parallel Applications using MPI-3’s non-blocking Collectives

Abstract

Published Version

Talk to us

Similar Papers

More From: Procedia Computer Science

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

High Performance and Enhanced Scalability for Parallel Applications using MPI-3’s non-blocking Collectives

Abstract

Published Version

Talk to us

Similar Papers

More From: Procedia Computer Science