High Performance Stencil Code Algorithms for GPGPUs

Andreas Schäfer,Dietmar Fey

doi:10.1016/j.procs.2011.04.221

Abstract

In this paper we investigate how stencil computations can be implemented on state-of-the-art general purpose graphics processing units (GPGPUs). Stencil codes can be found at the core of many numerical solvers and physical simulation codes and are therefore of particular interest to scientific computing research. GPGPUs have gained a lot of attention recently because of their superior floating point performance and memory bandwidth. Nevertheless, especially memory bound stencil codes have proven to be challenging for GPGPUs, yielding lower than to be expected speedups. We chose the Jacobi method as a standard benchmark to evaluate a set of algorithms on NVIDIA's latest Fermi chipset. One of our fastest algorithms is a parallel wavefront update. It exploits the enlarged on-chip shared memory to perform two time step updates per sweep. To the best of our knowledge, it represents the first successful applicationof temporal blocking for 3D stencils on GPGPUs and thereby exceeds previous results by a considerable margin. It is also the first paper to study stencil codes on Fermi.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

High Performance Stencil Code Algorithms for GPGPUs

Abstract

Talk to us

Similar Papers

More From: Procedia Computer Science

Lead the way for us

Journal: Procedia Computer Science	Publication Date: Jan 1, 2011
Citations: 59

Similar Papers

Stencil computation optimization and auto-tuning on state-of-the-art multicore architectures. - eScholarship
...
-
, et. al. ...
01 Jan 2008
01 Jan 2008

CUDA usage in electrodynamics and mechatronics
K Mrowca
-
K MrowcaK Mrowca
01 Oct 2011
01 Oct 2011

Supporting Preemptive Task Executions and Memory Copies in GPGPUs
Can Basaran ... Kyoung-Don Kang
-
Can Basaran, et. al.Can Basaran ... Kyoung-Don Kang
01 Jul 2012
01 Jul 2012

The Impact of Asynchronous GPGPU Behaviors on Stochastic Simulation
John C Steuben ... Cameron J Turner
-
John C Steuben, et. al.John C Steuben ... Cameron J Turner
04 Aug 2013
04 Aug 2013

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

High Performance Stencil Code Algorithms for GPGPUs

Abstract

Talk to us

Similar Papers

More From: Procedia Computer Science