Fast Parallel Stream Compaction for IA-Based Multi/many-core Processors

Qiao Sun,Fangfang Liu,Leisheng Li,Changmao Wu,Chao Yang

doi:10.1109/ccgrid.2016.112

Abstract

Stream compaction, frequently found in a large variety of applications, serves as a general primitive that reduces an input stream to a subset containing only the wanted elements so that the follow-on computation can be done efficiently. In this paper, we propose a fast parallel stream compaction for IA-based multi-/many-core processors. Unlike the previously studied algorithms that depend heavily on a black-box parallel scan, we open the black-box in the proposed algorithm and manually tailor it so that both the workload and the memory footprint is significantly reduced. By further eliminating the conditional statements and applying automatic code generation/optimization for performance-critical kernels, the proposed parallel stream compaction achieves high performance in different cases and for various data types across different IA-based multi/manycore platforms. Experimental results on three typical IA-based processors, including a quad-core Core-i7 CPU, a dual-socket 8- core Xeon CPU, and a 61-core Xeon Phi accelerator show that the proposed implementation outperforms the referenced parallel counterpart in the state-of-art library Thrust. On top of the above, we apply it in the random forest based data classifier to show its potential to boost the performance of real-world applications.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Fast Parallel Stream Compaction for IA-Based Multi/many-core Processors

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

InK-Compact: In-Kernel Stream Compaction and Its Application to Multi-Kernel Data Visualization on General-Purpose GPUs
D M Hughes ... M W Jones
Computer Graphics Forum | VOL. 32
D M Hughes, et. al.D M Hughes ... M W Jones
12 Apr 2013
Computer Graphics Forum | VOL. 32

NOC characteristics of cloud applications
Pejman Lotfi-Kamran ... Mehdi Modarressi
-
Pejman Lotfi-Kamran, et. al.Pejman Lotfi-Kamran ... Mehdi Modarressi
01 Dec 2017
01 Dec 2017

On the Parallelization of Stream Compaction on a Low-Cost SDC Cluster
Gregorio Bernabé ... Manuel E Acacio
Scientific Programming | VOL. 2018
Gregorio Bernabé, et. al.Gregorio Bernabé ... Manuel E Acacio
23 Aug 2018
Scientific Programming | VOL. 2018

Foreword to the special issue of the workshop on high performance computing systems (XVIII Simpósio em Sistemas Computacionais de Alto Desempenho, WSCAD 2017)
César A F De Rose ... Márcio Castro
Concurrency and Computation: Practice and Experience | VOL. 31
César A F De Rose, et. al.César A F De Rose ... Márcio Castro
07 May 2019
Concurrency and Computation: Practice and Experience | VOL. 31

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Fast Parallel Stream Compaction for IA-Based Multi/many-core Processors

Abstract

Talk to us

Similar Papers