Abstract
We present results on the world's first over 100 PFLOPS single precision lattice QCD quark solver on the Japanese new supercomputer Fugaku. We achieve a factor 38 time speedup from the supercomputer K on the same problem size, 1924, with 102 PFLOPS, 10% floating-point operation efficiency against single precision floating-point operation peak. The evaluation region is the single precision BiCGStab for a Clover–Wilson Dirac matrix with Schwarz Alternating Procedure domain decomposition preconditioning using Jacobi iteration for the local domain matrix inversion.
Submitted Version (Free)
Published Version
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have