Efficient SIMD code generation for irregular kernels

Seonggun Kim,Hwansoo Han

doi:10.1145/2370036.2145824

Abstract

Array indirection causes several challenges for compilers to utilize single instruction, multiple data (SIMD) instructions. Disjoint memory references, arbitrarily misaligned memory references, and dependence cycles in loops are main challenges to handle for SIMD compilers. Due to those challenges, existing SIMD compilers have excluded loops with array indirection from their candidate loops for SIMD vectorization. However, addressing those challenges is inevitable, since many important compute-intensive applications extensively use array indirection to reduce memory and computation requirements. In this work, we propose a method to generate efficient SIMD code for loops containing indirected memory references. We extract both inter- and intra-iteration parallelism, taking data reorganization overhead into consideration. We also optimally place data reorganization code in order to amortize the reorganization overhead through the performance gain of SIMD vectorization. Experiments on four array indirection kernels, which are extracted from real-world scientific applications, show that our proposed method effectively generates SIMD code for irregular kernels with array indirection. Compared to the existing SIMD vectorization methods, our proposed method significantly improves the performance of irregular kernels by 91%, on average.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Efficient SIMD code generation for irregular kernels

Abstract

Talk to us

Similar Papers

More From: ACM SIGPLAN Notices

Lead the way for us

Journal: ACM SIGPLAN Notices	Publication Date: Feb 25, 2012
Citations: 15

Similar Papers

Efficient SIMD code generation for irregular kernels
Seonggun Kim ... Hwansoo Han
-
Seonggun Kim, et. al.Seonggun Kim ... Hwansoo Han
25 Feb 2012
25 Feb 2012

Design of Parallel BEM Analyses Framework for SIMD Processors
Tetsuya Hoshino ... Akihiro Ida
-
Tetsuya Hoshino, et. al.Tetsuya Hoshino ... Akihiro Ida
01 Jan 2018
01 Jan 2018

Vectorization Programming Based on HR DSP Using SIMD
Chunhu Xie ... Huachun Wu
Electronics | VOL. 12
Chunhu Xie, et. al.Chunhu Xie ... Huachun Wu
03 Jul 2023
Electronics | VOL. 12

Single instruction multiple data vectorization of non-normalized loops
Yongsheng Hou ... Rongcai Zhao
Journal of Computer Applications | VOL. 33
Yongsheng Hou, et. al.Yongsheng Hou ... Rongcai Zhao
26 Nov 2013
Journal of Computer Applications | VOL. 33

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Efficient SIMD code generation for irregular kernels

Abstract

Talk to us

Similar Papers

More From: ACM SIGPLAN Notices