Performance Optimization of Massively Parallel FDTD Computations

Abdulaziz R Alazmi ,Abdulrahman R Alazmi

doi:10.4156/rnis.vol11.issue0.7

Abstract

The advancements of General Purpose Graphic Processing Units GPGPUs, have paved the way for computationally intensive scientific calculations to be done on an off the shelf massively parallel graphic processors GPUs, rather than the use of expensive solutions such as High Performance Clusters HPCs or Supercomputers. In this paper the NVIDIA’s Compute Unified Device Architecture CUDA on an NVIDIA GeForce processor will be used to solve a Finite Difference Time Domain FDTD computation. An FDTD computation has unfeasible running time using a single processor, and attempts to use multi-core CPUs have been made, but with the high overhead of network traffic in HPCs or synchronizations among cores. Multiple attempts have been made to utilize GPU for solving FDTD computations; but in this paper the focus will be on optimizing the efficiency of the algorithm by maximizing the throughput through affective use of the fast on-chip shared memory, and avoid using the slow off-chip global memory.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Performance Optimization of Massively Parallel FDTD Computations

Abstract

Talk to us

Similar Papers

More From: Research Notes in Information Science

Lead the way for us

Journal: Research Notes in Information Science	Publication Date: Jan 31, 2013
Citations: 12

Similar Papers

Analysis of performance enhancement on graphic processor based heterogeneous architecture: A CUDA and MATLAB experiment
Vilas H Naik ... Chidanand S Kusur
-
Vilas H Naik, et. al.Vilas H Naik ... Chidanand S Kusur
01 Feb 2015
01 Feb 2015

Reducing off-chip memory traffic by selective cache management scheme in GPGPUs
Hyojin Choi ... Jaewoo Ahn
-
Hyojin Choi, et. al.Hyojin Choi ... Jaewoo Ahn
03 Mar 2012
03 Mar 2012

Stream Processing of a Neural Classifier II
M Martínez-Zarzuela ... D González Ortega
-
M Martínez-Zarzuela, et. al.M Martínez-Zarzuela ... D González Ortega
01 Jan 2009
01 Jan 2009

Neuromorphic models on a GPGPU cluster
Bing Han ... Tarek M Taha
-
Bing Han, et. al.Bing Han ... Tarek M Taha
01 Jul 2010
01 Jul 2010

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Performance Optimization of Massively Parallel FDTD Computations

Abstract

Talk to us

Similar Papers

More From: Research Notes in Information Science