An adaptive performance modeling tool for GPU architectures

Sara S Baghsorkhi,William D Gropp,Sanjay J Patel,Matthieu Delahaye,Wen-Mei W Hwu

doi:10.1145/1693453.1693470

Abstract

This paper presents an analytical model to predict the performance ofgeneral-purpose applications on a GPU architecture. The model is designed to provide performance information to an auto-tuning compiler and assist it in narrowing down the search to the more promising implementations. It can also be incorporated into a tool to help programmers better assess the performance bottlenecks in their code. We analyze each GPU kernel and identify how the kernel exercises major GPU microarchitecture features. To identify the performance bottlenecks accurately, we introduce an abstract interpretation of a GPU kernel, work flow graph, based on which we estimate the execution time of a GPU kernel. We validated our performance model on the NVIDIA GPUs using CUDA (Compute Unified Device Architecture). For this purpose, we used data parallel benchmarks that stress different GPU microarchitecture events such as uncoalesced memory accesses, scratch-pad memory bank conflicts, and control flow divergence, which must be accurately modeled but represent challenges to the analytical performance models. The proposed model captures full system complexity and shows high accuracy in predicting the performance trends of different optimized kernel implementations. We also describe our approach to extracting the performance model automatically from a kernel code.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

An adaptive performance modeling tool for GPU architectures

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

An adaptive performance modeling tool for GPU architectures
Sara S Baghsorkhi ... Wen-Mei W Hwu
ACM SIGPLAN Notices | VOL. 45
Sara S Baghsorkhi, et. al.Sara S Baghsorkhi ... Wen-Mei W Hwu
09 Jan 2010
ACM SIGPLAN Notices | VOL. 45

Bounding the effect of partition camping in GPU kernels
Ashwin M Aji ... Mayank Daga
-
Ashwin M Aji, et. al.Ashwin M Aji ... Mayank Daga
03 May 2011
03 May 2011

The Realm of Graphical Processing Unit (GPU) Computing
Vivek K Pallipuram ... Jinzhu Gao
-
Vivek K Pallipuram, et. al.Vivek K Pallipuram ... Jinzhu Gao
01 Jan 2018
01 Jan 2018

An Insightful Program Performance Tuning Chain for GPU Computing
Haipeng Jia ... Yunquan Zhang
-
Haipeng Jia, et. al.Haipeng Jia ... Yunquan Zhang
01 Jan 2012
01 Jan 2012

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

An adaptive performance modeling tool for GPU architectures

Abstract

Talk to us

Similar Papers