Simulation of the performance and scalability of message passing interface (MPI) communications of atmospheric models running on exascale supercomputers

Yongjun Zheng,Philippe Marguinaud

doi:10.5194/gmd-11-3409-2018

Yongjun Zheng, Philippe Marguinaud

Open Access

https://doi.org/10.5194/gmd-11-3409-2018

Copy DOI

Abstract

Abstract. In this study, we identify the key message passing interface (MPI) operations required in atmospheric modelling; then, we use a skeleton program and a simulation framework (based on SST/macro simulation package) to simulate these MPI operations (transposition, halo exchange, and allreduce), with the perspective of future exascale machines in mind. The experimental results show that the choice of the collective algorithm has a great impact on the performance of communications; in particular, we find that the generalized ring-k algorithm for the alltoallv operation and the generalized recursive-k algorithm for the allreduce operation perform the best. In addition, we observe that the impacts of interconnect topologies and routing algorithms on the performance and scalability of transpositions, halo exchange, and allreduce operations are significant. However, the routing algorithm has a negligible impact on the performance of allreduce operations because of its small message size. It is impossible to infinitely grow bandwidth and reduce latency due to hardware limitations. Thus, congestion may occur and limit the continuous improvement of the performance of communications. The experiments show that the performance of communications can be improved when congestion is mitigated by a proper configuration of the topology and routing algorithm, which uniformly distribute the congestion over the interconnect network to avoid the hotspots and bottlenecks caused by congestion. It is generally believed that the transpositions seriously limit the scalability of the spectral models. The experiments show that the communication time of the transposition is larger than those of the wide halo exchange for the semi-Lagrangian method and the allreduce in the generalized conjugate residual (GCR) iterative solver for the semi-implicit method below 2×105 MPI processes. The transposition whose communication time decreases quickly with increasing number of MPI processes demonstrates strong scalability in the case of very large grids and moderate latencies. The halo exchange whose communication time decreases more slowly than that of transposition with increasing number of MPI processes reveals its weak scalability. In contrast, the allreduce whose communication time increases with increasing number of MPI processes does not scale well. From this point of view, the scalability of spectral models could still be acceptable. Therefore it seems to be premature to conclude that the scalability of the grid-point models is better than that of spectral models at the exascale, unless innovative methods are exploited to mitigate the problem of the scalability presented in the grid-point models.

Highlights

Current high-performance computing (HPC) systems have thousands of nodes and millions of cores
What would the performance of a global numerical weather prediction (NWP) model with a very high resolution on exascale HPC be? In this paper, we are especially interested in the strong scaling of an atmospheric model, that is, how does the atmospheric model with fixed resolution behave as the number of processes increases? In this study, these strong scalings of the three key message passing interface (MPI) operations in the atmospheric model are assessed for 102, 2×102, · · ·, 9×102, 103, 2×103, · · ·, 9×103, 104, 2× 104, · · ·, 9×104, 105, 2×105, · · ·, 9×105, 106 MPI tasks; but the maximum number of processes is 2 × 105 for the MPI transposition owing to the hard time limitation in our cluster
Besides the topology and its configuration, the routing algorithm, and the collective MPI algorithm; the bandwidth and the latency of the interconnect network of an HPC system have a great impact on the performance of communications

Summary

Introduction

Current high-performance computing (HPC) systems have thousands of nodes and millions of cores. It is envisaged that exascale HPC system with millions of nodes and thousands of cores per node, whose peak perfor-. Exascale HPC poses several challenges in terms of power consumption, performance, scalability, programmability, and resilience. The interconnect network of exascale HPC system becomes larger and more complex, and its performance which largely determines the overall performance of the HPC system is crucial to the performance of distributed applications. Designing energy-efficient costscalable interconnect networks and communication-efficient scalable distributed applications is an important component of HPC hardware/software co-design to address these challenges. Evaluating and predicting the communication behaviour of distributed applications is obligatory; it is only feasible by modelling the communications and the underlying interconnect network, especially for the future supercomputer

Objectives

Results

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Geoscientific Model Development	Publication Date: Aug 22, 2018
Citations: 6	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Simulation of the performance and scalability of message passing interface (MPI) communications of atmospheric models running on exascale supercomputers

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Geoscientific Model Development

Lead the way for us

Similar Papers

SUARA: A scalable universal allreduce communication algorithm for acceleration of parallel deep learning applications
Emin Nuriyev ... Alexey L Lastovetsky
Journal of Parallel and Distributed Computing | VOL. 183
Emin Nuriyev, et. al.Emin Nuriyev ... Alexey L Lastovetsky
15 Sep 2023
Journal of Parallel and Distributed Computing | VOL. 183

Enhancement Of Advanced Scalable Method For Decomposition And Adaptive Solution For Parallel And Distributed Environment
...
International Journal of Advance Engineering and Research Development | VOL. 1
, et. al. ...
31 Dec 2015
International Journal of Advance Engineering and Research Development | VOL. 1

A scalability study of the Ice-sheet and Sea-level System Model (ISSM, version 4.18)
Yannic Fischler ... Martin Rückamp
Geoscientific Model Development | VOL. 15
Yannic Fischler, et. al.Yannic Fischler ... Martin Rückamp
10 May 2022
Geoscientific Model Development | VOL. 15

Scalable plasma simulation with ELMFIRE using efficient data structures for process communication
Artur Signell ... Jan Westerholm
Computer Physics Communications | VOL. 179
Artur Signell, et. al.Artur Signell ... Jan Westerholm
09 Apr 2008
Computer Physics Communications | VOL. 179

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Simulation of the performance and scalability of message passing interface (MPI) communications of atmospheric models running on exascale supercomputers

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Geoscientific Model Development