An FMM Based on Dual Tree Traversal for Many-Core Architectures

R Yokota

doi:10.1260/1748-3018.7.3.301

Abstract

The present work attempts to integrate the independent efforts in the fast N-body community to create the fastest N-body library for many-core and heterogenous architectures. Focus is placed on low accuracy optimizations, in response to the recent interest to use FMM as a preconditioner for sparse linear solvers. A direct comparison with other state-of-the-art fast N-body codes demonstrates that orders of magnitude increase in performance can be achieved by careful selection of the optimal algorithm and low-level optimization of the code. The current N-body solver uses a fast multipole method with an efficient strategy for finding the list of cell-cell interactions by a dual tree traversal. A task-based threading model is used to maximize thread-level parallelism and intra-node load-balancing. In order to extract the full potential of the SIMD units on the latest CPUs, the inner kernels are optimized using AVX instructions.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Journal of Algorithms & Computational Technology	Publication Date: Sep 1, 2013
Citations: 81	License type: cc-by-nc

R Discovery Prime

R Discovery Prime

An FMM Based on Dual Tree Traversal for Many-Core Architectures

Abstract

Talk to us

Similar Papers

More From: Journal of Algorithms & Computational Technology

Lead the way for us

Similar Papers

Improving the FMM performance using optimal group size on heterogeneous system architectures
J A López-Fernández ... M López-Portugués
The Journal of Supercomputing | VOL. 73
J A López-Fernández, et. al.J A López-Fernández ... M López-Portugués
08 Sep 2016
The Journal of Supercomputing | VOL. 73

Shape optimization of sound barrier using an isogeometric fast multipole boundary element method in two dimensions
Cheng Liu ... Haibo Chen
Engineering Analysis with Boundary Elements | VOL. 85
Cheng Liu, et. al.Cheng Liu ... Haibo Chen
01 Nov 2017
Engineering Analysis with Boundary Elements | VOL. 85

A fast multipole hybrid boundary node method for composite materials
Qiao Wang ... Hongping Zhu
Computational Mechanics | VOL. 51
Qiao Wang, et. al.Qiao Wang ... Hongping Zhu
02 Aug 2012
Computational Mechanics | VOL. 51

Application of new fast multipole boundary integral equation method to crack problems in 3D
Ken-Ichi Yoshida ... Shoichi Kobayashi
Engineering Analysis with Boundary Elements | VOL. 25
Ken-Ichi Yoshida, et. al.Ken-Ichi Yoshida ... Shoichi Kobayashi
01 Apr 2001
Engineering Analysis with Boundary Elements | VOL. 25

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

An FMM Based on Dual Tree Traversal for Many-Core Architectures

Abstract

Talk to us

Similar Papers

More From: Journal of Algorithms &amp; Computational Technology

More From: Journal of Algorithms & Computational Technology