Faster set intersection with SIMD instructions by reducing branch mispredictions

Hiroshi Inoue,Kenjiro Taura,Moriyoshi Ohara

doi:10.14778/2735508.2735518

Hiroshi Inoue, Kenjiro Taura + Show 1 more

Open Access

https://doi.org/10.14778/2735508.2735518

Copy DOI

Abstract

Set intersection is one of the most important operations for many applications such as Web search engines or database management systems. This paper describes our new algorithm to efficiently find set intersections with sorted arrays on modern processors with SIMD instructions and high branch misprediction penalties. Our algorithm efficiently exploits SIMD instructions and can drastically reduce branch mispredictions. Our algorithm extends a merge-based algorithm by reading multiple elements, instead of just one element, from each of two input arrays and compares all of the pairs of elements from the two arrays to find the elements with the same values. The key insight for our improvement is that we can reduce the number of costly hard-to-predict conditional branches by advancing a pointer by more than one element at a time. Although this algorithm increases the total number of comparisons, we can execute these comparisons more efficiently using the SIMD instructions and gain the benefits of the reduced branch misprediction overhead. Our algorithm is suitable to replace existing standard library functions, such as std::set_intersection in C++, thus accelerating many applications, because the algorithm is simple and requires no preprocessing to generate additional data structures. We implemented our algorithm on Xeon and POWER7+. The experimental results show our algorithm outperforms the std::set_intersection implementation delivered with gcc by up to 5.2x using SIMD instructions and by up to 2.1x even without using SIMD instructions for 32-bit and 64-bit integer datasets. Our SIMD algorithm also outperformed an existing algorithm that can leverage SIMD instructions.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Proceedings of the VLDB Endowment	Publication Date: Nov 1, 2014
Citations: 58	License type: cc-by

R Discovery Prime

R Discovery Prime

Faster set intersection with SIMD instructions by reducing branch mispredictions

Abstract

Talk to us

Similar Papers

More From: Proceedings of the VLDB Endowment

Lead the way for us

Similar Papers

Speeding Up Set Intersections in Graph Algorithms using SIMD Instructions
Shuo Han ... Lei Zou
-
Shuo Han, et. al.Shuo Han ... Lei Zou
27 May 2018
27 May 2018

Improving SIMD code generation in QEMU
...
-
, et. al. ...
09 Mar 2015
09 Mar 2015

Improving SIMD Code Generation in QEMU
Sheng-Yu Fu ... Jan-Jan Wu
-
Sheng-Yu Fu, et. al.Sheng-Yu Fu ... Jan-Jan Wu
01 Jan 2015
01 Jan 2015

Implementing database operations using SIMD instructions
Jingren Zhou ... Kenneth A Ross
-
Jingren Zhou, et. al.Jingren Zhou ... Kenneth A Ross
03 Jun 2002
03 Jun 2002

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Faster set intersection with SIMD instructions by reducing branch mispredictions

Abstract

Talk to us

Similar Papers

More From: Proceedings of the VLDB Endowment