A GPU implementation of inclusion-based points-to analysis

Mario Mendez-Lojo,Keshav Pingali,Martin Burtscher

doi:10.1145/2370036.2145831

Abstract

Graphics Processing Units (GPUs) have emerged as powerful accelerators for many regular algorithms that operate on dense arrays and matrices. In contrast, we know relatively little about using GPUs to accelerate highly irregular algorithms that operate on pointer-based data structures such as graphs. For the most part, research has focused on GPU implementations of graph analysis algorithms that do not modify the structure of the graph, such as algorithms for breadth-first search and strongly-connected components. In this paper, we describe a high-performance GPU implementation of an important graph algorithm used in compilers such as gcc and LLVM: Andersen-style inclusion-based points-to analysis. This algorithm is challenging to parallelize effectively on GPUs because it makes extensive modifications to the structure of the underlying graph and performs relatively little computation. In spite of this, our program, when executed on a 14 Streaming Multiprocessor GPU, achieves an average speedup of 7x compared to a sequential CPU implementation and outperforms a parallel implementation of the same algorithm running on 16 CPU cores. Our implementation provides general insights into how to produce high-performance GPU implementations of graph algorithms, and it highlights key differences between optimizing parallel programs for multicore CPUs and for GPUs.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A GPU implementation of inclusion-based points-to analysis

Abstract

Talk to us

Similar Papers

More From: ACM SIGPLAN Notices

Lead the way for us

Journal: ACM SIGPLAN Notices	Publication Date: Feb 25, 2012
Citations: 17

Similar Papers

A GPU implementation of inclusion-based points-to analysis
Mario Mendez-Lojo ... Martin Burtscher
-
Mario Mendez-Lojo, et. al.Mario Mendez-Lojo ... Martin Burtscher
25 Feb 2012
25 Feb 2012

Performance Improvement in Large Graph Algorithms on GPU using CUDA: an Overview
Swapnil D Joshi ... V S Inamdar
International Journal of Computer Applications | VOL. 10
Swapnil D Joshi, et. al.Swapnil D Joshi ... V S Inamdar
10 Nov 2010
International Journal of Computer Applications | VOL. 10

A novel approach toward parallel implementation of BFS algorithm using graphic processor unit
Fahmid Al Farid ... Shohag Barman
-
Fahmid Al Farid, et. al.Fahmid Al Farid ... Shohag Barman
01 May 2015
01 May 2015

Analysis of A* Algorithm Optimization and Breadth First Search in the Water Teapot Game
Bonifacius Indriyono ... Widyatmoko
Inform : Jurnal Ilmiah Bidang Teknologi Informasi dan Komunikasi | VOL. 7
Bonifacius Indriyono, et. al.Bonifacius Indriyono ... Widyatmoko
27 Jul 2022
Inform : Jurnal Ilmiah Bidang Teknologi Informasi dan Komunikasi | VOL. 7

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A GPU implementation of inclusion-based points-to analysis

Abstract

Talk to us

Similar Papers

More From: ACM SIGPLAN Notices