Abstract

Lattice quantum chromodynamics (QCD) calculations were one of the first applications to demonstrate the potential of GPUs in the area of high-performance computing; the nature of lattice QCD calculations matches well with the GPU's computational model. This article discusses ways to effectively use GPUs for lattice calculations using the overlap operator, a discretization that preserves chiral symmetry even at nonzero lattice spacing and makes possible lattice QCD simulations in the parameter region relevant to nuclear physics. The author shows that the large memory footprint of these codes requires the use of multiple GPUs in parallel and discusses methods for implementing this operator efficiently: hybrid CPU/GPU memory use for eigensolvers and MPI/OpenMP/CUDA parallelization strategies required to take full advantage of both GPU and CPU resources. He then compares the performance of codes on a GPU cluster and a CPU cluster with similar interconnects, discussing the strong scaling for problem sizes relevant to current lattice QCD simulations.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call