A comparison of optimal FFTs on torus and hypercube multicomputers

Paul N Swarztrauber,Steven W Hammond

doi:10.1016/s0167-8191(00)00107-1

Abstract

In this paper we compare the optimum performance of the fast Fourier transform (FFT) on torus and hypercube multicomputers. The optimal number of floating point operations is architecturally invariant so the relative performance is determined by communication time. We compare the performance of known multiport communication algorithms that utilize the full bandwidth of the interconnection network on every cycle. Furthermore, data is routed unblocked over paths with minimum length. Specifically we compare FFT performance on the hypercube as well as two-, three- and four-dimensional torus architectures. On the hypercube we observe the somewhat surprising result that, in the limit, communication is ultimately negligible compared to computation. While the opposite is true of the torus, it is nevertheless possible to obtain comparable performance over a broad range of processors and problem sizes. As computers are built with increasing numbers of processors, torus performance can still be made comparable to the hypercube by gradually increasing the dimension of the interconnect. Any number of processors can be used to compute the FFT with efficiency that is theoretically bounded from zero.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A comparison of optimal FFTs on torus and hypercube multicomputers

Abstract

Talk to us

Similar Papers

More From: Parallel Computing

Lead the way for us

Journal: Parallel Computing	Publication Date: Apr 4, 2001
Citations: 20

Similar Papers

Parallel FFT algorithms using radix 4 butterfly computation on an eight-neighbor processor array
Kuninobu Tanno ... Susumu Horiguchi
Parallel Computing | VOL. 21
Kuninobu Tanno, et. al.Kuninobu Tanno ... Susumu Horiguchi
01 Jan 1995
Parallel Computing | VOL. 21

Hybrid and 4-D FFT implementations of an open-source parallel FFT package OpenFFT
Truong Vinh Truong Duy ... Taisuke Ozaki
The Journal of Supercomputing | VOL. 72
Truong Vinh Truong Duy, et. al.Truong Vinh Truong Duy ... Taisuke Ozaki
14 Dec 2015
The Journal of Supercomputing | VOL. 72

Scalability study in parallel computing
Mark Alan Fienup
-
Mark Alan FienupMark Alan Fienup
04 Dec 2014
04 Dec 2014

A Memory-Constrained Scalability Metric
M.A Fienup ... S.C Kothari
-
M.A Fienup, et. al.M.A Fienup ... S.C Kothari
01 Jan 1993
01 Jan 1993

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A comparison of optimal FFTs on torus and hypercube multicomputers

Abstract

Talk to us

Similar Papers

More From: Parallel Computing