Near-global climate simulation at 1 km resolution: establishing a performance baseline on 4888 GPUs with COSMO 5.0

Oliver Fuhrer,Christoph Schär,Tarun Chadha,David Leutwyler,Thomas C Schulthess,Daniel Lüthi,Torsten Hoefler,Hannes Vogt,Grzegorz Kwasniewski,Carlos Osuna,Xavier Lapillonne

doi:10.5194/gmd-11-1665-2018

Abstract

Abstract. The best hope for reducing long-standing global climate model biases is by increasing resolution to the kilometer scale. Here we present results from an ultrahigh-resolution non-hydrostatic climate model for a near-global setup running on the full Piz Daint supercomputer on 4888 GPUs (graphics processing units). The dynamical core of the model has been completely rewritten using a domain-specific language (DSL) for performance portability across different hardware architectures. Physical parameterizations and diagnostics have been ported using compiler directives. To our knowledge this represents the first complete atmospheric model being run entirely on accelerators on this scale. At a grid spacing of 930 m (1.9 km), we achieve a simulation throughput of 0.043 (0.23) simulated years per day and an energy consumption of 596 MWh per simulated year. Furthermore, we propose a new memory usage efficiency (MUE) metric that considers how efficiently the memory bandwidth – the dominant bottleneck of climate codes – is being used.

Highlights

Should global warming occur at the upper end of the range of current projections, the local impacts of unmitigated climate change would be dramatic
To enable the running of Consortium for Small-Scale Modeling (COSMO) on hybrid highperformance computing systems with graphics processing units (GPUs)-accelerated compute nodes, we rewrote the dynamical core of the model, which implements the solution to the non-hydrostatic Euler equations, from Fortran to C++ (Fuhrer et al, 2014)
Our implementation of the COSMO model that is used for production-level numerical weather predictions at MeteoSwiss has been scaled to the full system on 4888 nodes of Piz Daint, a GPU-accelerated Cray XC50 supercomputer at the Swiss National Supercomputing Centre (CSCS)

Summary

Introduction

Should global warming occur at the upper end of the range of current projections, the local impacts of unmitigated climate change would be dramatic. For Prediction Across Scales (MPAS) and later, in 2015, participated in the Generation Global Prediction System (NGGPS) model intercomparison project (Michalakes et al, 2015) at the same resolution and achieved 0.16 SYPD on the full National Energy Research Scientific Computing Center (NERSC) Edison system. Yang et al (2016a) use an acoustic Courant number up to 177; i.e., their time step is 177 times larger than in a standard explicit integration (this estimate is based on the x = 488 m simulation with t = 240 s) In their case, such a large time step may be chosen, as the sound propagation is not relevant for weather phenomena. Since the IFS model is not a non-hydrostatic model, we conclude that even for fully implicit, global, convection-resolving climate simulations at ∼ 1–2 km grid spacing, a time step larger than 40–60 s cannot be considered a viable option. The main advantage of this approach is that it exhibits – at least in theory – perfect weak scaling. This applies to the communication load per sub-domain, when applying horizontal domain decomposition

Model description

Hardware description

Energy measurements

Simulation setup and verification

Efficiency metric

Weak scalability

Strong scalability

Time to solution

Energy to solution

Simulation efficiency

Conclusions

Necessary transfers Q

COSMO CDAG

Findings

Maximum achievable bandwidth B

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Geoscientific Model Development	Publication Date: May 2, 2018
Citations: 110	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Near-global climate simulation at 1 km resolution: establishing a performance baseline on 4888 GPUs with COSMO 5.0

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Geoscientific Model Development

Lead the way for us

Similar Papers

Using Compiler Directives to Port Large Scientific Applications to GPUs: An Example from Atmospheric Science
Xavier Lapillonne ... Oliver Fuhrer
Parallel Processing Letters | VOL. 24
Xavier Lapillonne, et. al.Xavier Lapillonne ... Oliver Fuhrer
01 Mar 2014
Parallel Processing Letters | VOL. 24

Kilometer-Scale Climate Models: Prospects and Challenges
Christoph Schär ... Stefano Ubbiali
Bulletin of the American Meteorological Society | VOL. 101
Christoph Schär, et. al.Christoph Schär ... Stefano Ubbiali
01 May 2020
Bulletin of the American Meteorological Society | VOL. 101

Performance of a three-dimensional unstructured mesh compressible flow solver on NVIDIA Fermi-class graphics processing unit hardware
Jacob Waltz
International Journal for Numerical Methods in Fluids | VOL. 72
Jacob WaltzJacob Waltz
18 Oct 2012
International Journal for Numerical Methods in Fluids | VOL. 72

Compiler support for general-purpose computation on GPUs
Yu-Te Lin ... Peng-Sheng Chen
The Journal of Supercomputing | VOL. 50
Yu-Te Lin, et. al.Yu-Te Lin ... Peng-Sheng Chen
19 Nov 2008
The Journal of Supercomputing | VOL. 50

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Near-global climate simulation at 1 km resolution: establishing a performance baseline on 4888 GPUs with COSMO 5.0

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Geoscientific Model Development