Hierarchical redesign of classic MPI reduction algorithms

Khalid Hasanov,Alexey Lastovetsky

doi:10.1007/s11227-016-1779-7

Abstract

Optimization of MPI collective communication operations has been an active research topic since the advent of MPI in 1990s. Many general and architecture-specific collective algorithms have been proposed and implemented in the state-of-the-art MPI implementations. Hierarchical topology-oblivious transformation of existing communication algorithms has been recently proposed as a new promising approach to optimization of MPI collective communication algorithms and MPI-based applications. This approach has been successfully applied to the most popular parallel matrix multiplication algorithm, SUMMA, and the state-of-the-art MPI broadcast algorithms, demonstrating significant multifold performance gains, especially for large-scale HPC systems. In this paper, we apply this approach to optimization of the MPI Reduce and Allreduce operations. Theoretical analysis and experimental results on a cluster of Grid’5000 platform are presented.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Hierarchical redesign of classic MPI reduction algorithms

Abstract

Talk to us

Similar Papers

More From: The Journal of Supercomputing

Lead the way for us

Journal: The Journal of Supercomputing	Publication Date: Jun 18, 2016
Citations: 27

Similar Papers

Hierarchical Optimization of MPI Reduce Algorithms
Khalid Hasanov ... Alexey Lastovetsky
-
Khalid Hasanov, et. al.Khalid Hasanov ... Alexey Lastovetsky
01 Jan 2015
01 Jan 2015

Recent advances in the Message Passing Interface
Javier Garcia Blas ... Jesus Carretero
The International Journal of High Performance Computing Applications | VOL. 28
Javier Garcia Blas, et. al.Javier Garcia Blas ... Jesus Carretero
01 Nov 2014
The International Journal of High Performance Computing Applications | VOL. 28

Automatically Tuned Collective Communications
...
-
, et. al. ...
01 Nov 2000
01 Nov 2000

Accelerating Allreduce Operation: A Switch-Based Solution
Nongda Hu ... Ninghui Sun
-
Nongda Hu, et. al.Nongda Hu ... Ninghui Sun
01 Jul 2013
01 Jul 2013

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Hierarchical redesign of classic MPI reduction algorithms

Abstract

Talk to us

Similar Papers

More From: The Journal of Supercomputing