Characterizing Multi-Chip GPU Data Sharing

Shiqing Zhang,Magnus Jahre,Lieven Eeckhout,Mahmood Naderan-Tahan

doi:10.1145/3629521

Abstract

Multi-chip Graphics Processing Unit (GPU) systems are critical to scale performance beyond a single GPU chip for a wide variety of important emerging applications. A key challenge for multi-chip GPUs, though, is how to overcome the bandwidth gap between inter-chip and intra-chip communication. Accesses to shared data, i.e., data accessed by multiple chips, pose a major performance challenge as they incur remote memory accesses possibly congesting the inter-chip links and degrading overall system performance. This article characterizes the shared dataset in multi-chip GPUs in terms of (1) truly versus falsely shared data, (2) how the shared dataset scales with input size, (3) along which dimensions the shared dataset scales, and (4) how sensitive the shared dataset is with respect to the input’s characteristics, i.e., node degree and connectivity in graph workloads. We observe significant variety in scaling behavior across workloads: some workloads feature a shared dataset that scales linearly with input size, whereas others feature sublinear scaling (following a \(\sqrt {2}\) or \(\sqrt [3]{2}\) relationship). We further demonstrate how the shared dataset affects the optimum last-level cache organization (memory-side versus SM-side) in multi-chip GPUs, as well as optimum memory page allocation and thread scheduling policy. Sensitivity analyses demonstrate the insights across the broad design space.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Characterizing Multi-Chip GPU Data Sharing

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Architecture and Code Optimization

Lead the way for us

Similar Papers

Reidentification of Participants in Shared Clinical Data Sets: Experimental Study
Daniela Wiepert ... Rene L Utianski
JMIR AI | VOL. 3
Daniela Wiepert, et. al.Daniela Wiepert ... Rene L Utianski
15 Mar 2024
JMIR AI | VOL. 3

Investigating the Secondary Use of Clinical Research Data: Protocol for a Mixed Methods Study.
Naomi Waithira ... Anne Osterrieder
JMIR Research Protocols | VOL. 12
Naomi Waithira, et. al.Naomi Waithira ... Anne Osterrieder
06 Mar 2023
JMIR Research Protocols | VOL. 12

Reply
Eric S Weiss ... John V Conte
The Annals of Thoracic Surgery | VOL. 87
Eric S Weiss, et. al.Eric S Weiss ... John V Conte
17 Apr 2009
The Annals of Thoracic Surgery | VOL. 87

Research Data Sharing: Framework of Factors that Can Influence Researchers
Elizabete Cristina De Souza De Aguiar Monteiro ... Ricardo César Gonçalves Sant’Ana
-
Elizabete Cristina De Souza De Aguiar Monteiro, et. al.Elizabete Cristina De Souza De Aguiar Monteiro ... Ricardo César Gonçalves Sant’Ana
01 Jan 2020
01 Jan 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Characterizing Multi-Chip GPU Data Sharing

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Architecture and Code Optimization