Emerging Architectures Enable to Boost Massively Parallel Data Mining Using Adaptive Sparse Grids

Alexander Heinecke,Dirk Pflüger

doi:10.1007/s10766-012-0202-0

Abstract

Gaining knowledge out of vast datasets is a main challenge in data-driven applications nowadays. Sparse grids provide a numerical method for both classification and regression in data mining which scales only linearly in the number of data points and is thus well-suited for huge amounts of data. Due to the recursive nature of sparse grid algorithms and their classical random memory access pattern, they impose a challenge for the parallelization on modern hardware architectures such as accelerators. In this paper, we present the parallelization on several current task- and data-parallel platforms, covering multi-core CPUs with vector units, GPUs, and hybrid systems. We demonstrate that a less efficient implementation from an algorithmical point of view can be beneficial if it allows vectorization and a higher degree of parallelism instead. Furthermore, we analyze the suitability of parallel programming languages for the implementation. Considering hardware, we restrict ourselves to the x86 platform with SSE and AVX vector extensions and to NVIDIA’s Fermi architecture for GPUs. We consider both multi-core CPU and GPU architectures independently, as well as hybrid systems with up to 12 cores and 2 Fermi GPUs. With respect to parallel programming, we examine both the open standard OpenCL and Intel Array Building Blocks, a recently introduced high-level programming approach, and comment on their ease of use. As the baseline, we use the best results obtained with classically parallelized sparse grid algorithms and their OpenMP-parallelized intrinsics counterpart (SSE and AVX instructions), reporting both single and double precision measurements. The huge data sets we use are a real-life dataset stemming from astrophysics and artificial ones, all of which exhibit challenging properties. In all settings, we achieve excellent results, obtaining speedups of up to 188 × using single precision on a hybrid system.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Emerging Architectures Enable to Boost Massively Parallel Data Mining Using Adaptive Sparse Grids

Abstract

Talk to us

Similar Papers

More From: International Journal of Parallel Programming

Lead the way for us

Journal: International Journal of Parallel Programming	Publication Date: Jul 3, 2012
Citations: 27

Similar Papers

Multi- and many-core data mining with adaptive sparse grids
Alexander Heinecke ... Dirk Pflüger
-
Alexander Heinecke, et. al.Alexander Heinecke ... Dirk Pflüger
03 May 2011
03 May 2011

An Adaptive Sparse Grid Algorithm for Elliptic PDEs with Lognormal Diffusion Coefficient
Fabio Nobile ... Raúl Tempone
-
Fabio Nobile, et. al.Fabio Nobile ... Raúl Tempone
01 Jan 2015
01 Jan 2015

Alternating Direction Method of Multipliers for Hierarchical Basis Approximators
Valeriy Khakhutskyy ... Dirk Pflüger
-
Valeriy Khakhutskyy, et. al.Valeriy Khakhutskyy ... Dirk Pflüger
01 Jan 2014
01 Jan 2014

Uncertainty quantification for the Hokkaido Nansei-Oki tsunami using B-splines on adaptive sparse grids
Michael Rehme ... Stephen Roberts
ANZIAM Journal | VOL. 62
Michael Rehme, et. al.Michael Rehme ... Stephen Roberts
29 Jun 2021
ANZIAM Journal | VOL. 62

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Emerging Architectures Enable to Boost Massively Parallel Data Mining Using Adaptive Sparse Grids

Abstract

Talk to us

Similar Papers

More From: International Journal of Parallel Programming