Optimized FFT computations on heterogeneous platforms with application to the Poisson equation

Jing Wu,Joseph Jaja

doi:10.1016/j.jpdc.2014.03.009

Abstract

We develop optimized multi-dimensional FFT implementations on CPU–GPU heterogeneous platforms for the case when the input is too large to fit on the GPU global memory, and use the resulting techniques to develop a fast Poisson solver. The solver involves memory bound computations for which the large 3D data may have to be transferred over the PCIe bus several times during the computation. We develop a new strategy to decompose and allocate the computation between the GPU and the CPU such that the 3D data is transferred only once to the device memory, and the executions of the GPU kernels are almost completely overlapped with the PCI data transfer. We were able to achieve significantly better performance than what has been reported in previous related work, including over 145 GFLOPS for the three periodic boundary conditions (single precision version), and over 105 GFLOPS for the two periodic, one Neumann boundary conditions (single precision version). The effective bidirectional PCIe bus bandwidth achieved is 9–10 GB/s, which is close to the best possible on our platform. For all the cases tested, the single 3D data PCIe transfer time, which constitutes a lower bound on what is possible on our platform, takes almost 70% of the total execution time of the Poisson solver.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Optimized FFT computations on heterogeneous platforms with application to the Poisson equation

Abstract

Talk to us

Similar Papers

More From: Journal of Parallel and Distributed Computing

Lead the way for us

Journal: Journal of Parallel and Distributed Computing	Publication Date: Mar 28, 2014
Citations: 25

Similar Papers

High Performance FFT Based Poisson Solver on a CPU-GPU Heterogeneous Platform
Jing Wu ... Joseph Jaja
-
Jing Wu, et. al.Jing Wu ... Joseph Jaja
01 May 2013
01 May 2013

Existence and multiplicity results for some nonlinear problems with singular ϕ-Laplacian
C Bereanu ... J Mawhin
Journal of Differential Equations | VOL. 243
C Bereanu, et. al.C Bereanu ... J Mawhin
04 Jun 2007
Journal of Differential Equations | VOL. 243

Noise-induced, ac-stabilized sine-Gordon breathers: Emergence and statistics
Duilio De Santis ... Davide Valenti
Communications in Nonlinear Science and Numerical Simulation | VOL. 131
Duilio De Santis, et. al.Duilio De Santis ... Davide Valenti
22 Dec 2023
Communications in Nonlinear Science and Numerical Simulation | VOL. 131

Pseudospectral Time-Domain (PSTD) Methods for the Wave Equation: Realizing Boundary Conditions with Discrete Sine and Cosine Transforms
Elliott S Wise ... Bradley E Treeby
Journal of Theoretical and Computational Acoustics | VOL. 29
Elliott S Wise, et. al.Elliott S Wise ... Bradley E Treeby
19 Oct 2020
Journal of Theoretical and Computational Acoustics | VOL. 29

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Optimized FFT computations on heterogeneous platforms with application to the Poisson equation

Abstract

Talk to us

Similar Papers

More From: Journal of Parallel and Distributed Computing