SU (2) lattice gauge theory simulations on Fermi GPUs

Nuno Cardoso,Pedro Bicudo

doi:10.1016/j.jcp.2011.02.023

Abstract

In this work we explore the performance of CUDA in quenched lattice SU (2) simulations. CUDA, NVIDIA Compute Unified Device Architecture, is a hardware and software architecture developed by NVIDIA for computing on the GPU. We present an analysis and performance comparison between the GPU and CPU in single and double precision. Analyses with multiple GPUs and two different architectures (G200 and Fermi architectures) are also presented. In order to obtain a high performance, the code must be optimized for the GPU architecture, i.e., an implementation that exploits the memory hierarchy of the CUDA programming model. We produce codes for the Monte Carlo generation of SU (2) lattice gauge configurations, for the mean plaquette, for the Polyakov Loop at finite T and for the Wilson loop. We also present results for the potential using many configurations (50,000) without smearing and almost 2000 configurations with APE smearing. With two Fermi GPUs we have achieved an excellent performance of 200× the speed over one CPU, in single precision, around 110 Gflops/s. We also find that, using the Fermi architecture, double precision computations for the static quark-antiquark potential are not much slower (less than 2× slower) than single precision computations.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

SU (2) lattice gauge theory simulations on Fermi GPUs

Abstract

Talk to us

Similar Papers

More From: Journal of Computational Physics

Lead the way for us

Journal: Journal of Computational Physics	Publication Date: Feb 20, 2011
Citations: 22

Similar Papers

CFD Computations Using Preconditioned Krylov Solver on GPUs
Amit Amritkar ... Danesh Tafti
-
Amit Amritkar, et. al.Amit Amritkar ... Danesh Tafti
03 Aug 2014
03 Aug 2014

Efficient magnetohydrodynamic simulations on graphics processing units with CUDA
Hon-Cheng Wong ... Zesheng Tang
Computer Physics Communications | VOL. 182
Hon-Cheng Wong, et. al.Hon-Cheng Wong ... Zesheng Tang
18 May 2011
Computer Physics Communications | VOL. 182

The Impact of Multicore on Math Software and Exploiting Single Precision Computing to Obtain Double Precision Results
Jack Dongarra
-
Jack DongarraJack Dongarra
01 Jan 2006
01 Jan 2006

Emerging Architectures Enable to Boost Massively Parallel Data Mining Using Adaptive Sparse Grids
Alexander Heinecke ... Dirk Pflüger
International Journal of Parallel Programming | VOL. 41
Alexander Heinecke, et. al.Alexander Heinecke ... Dirk Pflüger
03 Jul 2012
International Journal of Parallel Programming | VOL. 41

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

SU (2) lattice gauge theory simulations on Fermi GPUs

Abstract

Talk to us

Similar Papers

More From: Journal of Computational Physics