Optimization of MPI-Process Mapping for Clusters with Angara Interconnect

M R Khalilov,A V Timofeev

doi:10.1134/s1995080218090111

Abstract

An algorithm of MPI processes mapping optimization is adapted for supercomputers with interconnect Angara. The mapping algorithm is based on partitioning of parallel program communication pattern. It is performed in such a way that the processes between which the most intensive exchanges take place are tied to the nodes/processors with the highest bandwidth. The algorithm finds a near-optimal distribution of its processes for processor cores to minimize the total execution time of exchanges between MPI processes. The analysis of results of optimized placement of processes using proposed method on small supercomputers is shown. The analysis of the dependence of the MPI program execution time on supercomputer parameters and task parameters is performed. A theoretical model is proposed for estimation of effect of mapping optimization on the execution time for several types of supercomputer topologies. The prospect of using implemented optimization library for large-scale supercomputers with the interconnect Angara is discussed.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Optimization of MPI-Process Mapping for Clusters with Angara Interconnect

Abstract

Talk to us

Similar Papers

More From: Lobachevskii Journal of Mathematics

Lead the way for us

Journal: Lobachevskii Journal of Mathematics	Publication Date: Nov 1, 2018
Citations: 8

Similar Papers

OpenMP, OpenMP/MPI, and CUDA/MPI C programs for solving the time-dependent dipolar Gross–Pitaevskii equation
Vladimir Lončar ... Antun Balaž
Computer Physics Communications | VOL. 209
Vladimir Lončar, et. al.Vladimir Lončar ... Antun Balaž
06 Sep 2016
Computer Physics Communications | VOL. 209

Non-preemptive offline multi-job mapping for a photonic network on a chip
Akram Reza ... Reza Faghih Mirzaee
Nano Communication Networks | VOL. 11
Akram Reza, et. al.Akram Reza ... Reza Faghih Mirzaee
23 Sep 2016
Nano Communication Networks | VOL. 11

Efficient partitioning technique on multiple cores based on optimal scheduling and mapping algorithm
Hassan Youness ... Yoshinori Takeuchi
-
Hassan Youness, et. al.Hassan Youness ... Yoshinori Takeuchi
01 May 2010
01 May 2010

IMPLEMENTATION OF THE SYMMETRICAL ENCRYPTION STANDARD DES USING C PROGRAMMING LANGUAGE AND COMPARISON ITS EXECUTION TIME WITH OTHER UTILITIES
Liudmyla Hlynchuk ... Andrii Stupin
Cybersecurity: Education, Science, Technique | VOL. 2
Liudmyla Hlynchuk, et. al.Liudmyla Hlynchuk ... Andrii Stupin
01 Jan 2020
Cybersecurity: Education, Science, Technique | VOL. 2

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Optimization of MPI-Process Mapping for Clusters with Angara Interconnect

Abstract

Talk to us

Similar Papers

More From: Lobachevskii Journal of Mathematics