A Case Study in Reverse Engineering GPGPUs

Ahmad Lashgar,Ebad Salehi,Amirali Baniasadi

doi:10.1145/2927964.2927968

Abstract

During recent years, GPU micro-architectures have changed dramatically, evolving into powerful many-core deep-multithreaded platforms for parallel workloads. While important micro-architectural modifications continue to appear in every new generation of these processors, unfortunately, little is known about the details of these innovative designs. One of the key questions in understanding GPUs is how they deal with outstanding memory misses. Our goal in this study is to find answers to this question. To this end, we develop a set of micro-benchmarks in CUDA to understand the outstanding memory requests handling resources. Particularly, we study two NVIDIA GPGPUs (Fermi and Kepler) and estimate their capability in handling outstanding memory requests. We show that Kepler can issue nearly 32X higher number of outstanding memory requests, compared to Fermi. We explain this enhancement by Kepler's architectural modifications in outstanding memory request handling resources.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A Case Study in Reverse Engineering GPGPUs

Abstract

Talk to us

Similar Papers

More From: ACM SIGARCH Computer Architecture News

Lead the way for us

Journal: ACM SIGARCH Computer Architecture News	Publication Date: Apr 22, 2016
Citations: 5

Similar Papers

<title>Video coprocessor: video processing in the DCT domain</title>
Ahmed M Darwish
-
Ahmed M DarwishAhmed M Darwish
21 Dec 1998
21 Dec 1998

The design and implementation of a VLSI chess move generator
Carl Ebeling ... Andrew Palay
ACM SIGARCH Computer Architecture News | VOL. 12
Carl Ebeling, et. al.Carl Ebeling ... Andrew Palay
01 Jan 1984
ACM SIGARCH Computer Architecture News | VOL. 12

A FASTBUS Controller Module Using a MULTIBUS MPU
S R Deiss
IEEE Transactions on Nuclear Science | VOL. 30
S R DeissS R Deiss
01 Oct 1982
IEEE Transactions on Nuclear Science | VOL. 30

Co-design and Signal-Power Integrity/EMI Co-analysis of a Switchable High-speed Inter-Chiplet Serial Link on an Active Interposer
Min Miao ... Zhuanzhuan Zhang
-
Min Miao, et. al.Min Miao ... Zhuanzhuan Zhang
01 May 2022
01 May 2022

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Case Study in Reverse Engineering GPGPUs

Abstract

Talk to us

Similar Papers

More From: ACM SIGARCH Computer Architecture News