Massively Parallel Server Processors

Varun Agrawal,Yuxuan Shui,Mina Abbasi Dinani,Nima Honarmand,Michael Ferdman

doi:10.1109/lca.2019.2911287

Abstract

Modern data centers enjoy massive degrees of request-level parallelism with significant cross-request similarity. Although similar requests follow similar instruction sequences, conventional processors service them individually and do not take full advantage of cross-request similarity. Single-Instruction Multiple-Thread (SIMT) architectures can leverage this similarity, however, existing SIMT processors—chief among them, GPUs—are ill-suited for server applications, as they are specifically designed to maximize throughput at the expense of latency, preventing them from meeting server QoS requirements. We advocate a new approach to SIMT server processors, namely Massively Parallel Server Processors (MPSPs), which we outline in this paper. To begin to understand their architectural needs, we measure the degree of control-flow and memory-access divergence encountered when running unmodified server applications on MPSP-style processors. Our preliminary results indicate that a software scheduler that bundles together similar requests can minimize control-flow divergence, making SIMT execution of unmodified server code feasible. Moreover, we find that memory-access divergence, although significant in raw numbers, can be tackled with changes in stack and heap layouts. Overall, our results encourage further consideration of MPSPs as a promising architecture for server processors.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Massively Parallel Server Processors

Abstract

Talk to us

Similar Papers

More From: IEEE Computer Architecture Letters

Lead the way for us

Journal: IEEE Computer Architecture Letters	Publication Date: Jan 1, 2019
Citations: 10

Similar Papers

The dual-path execution model for efficient GPU control flow
Minsoo Rhu ... M Erez
-
Minsoo Rhu, et. al. Minsoo Rhu ... M Erez
01 Feb 2013
01 Feb 2013

Unleashing the power of GPU for physically-based rendering via dynamic ray shuffling
Yashuai Lü ... Libo Huang
-
Yashuai Lü, et. al.Yashuai Lü ... Libo Huang
14 Oct 2017
14 Oct 2017

DARM: Control-Flow Melding for SIMT Thread Divergence Reduction
Charitha Saumya ... Kirshanthan Sundararajah
-
Charitha Saumya, et. al.Charitha Saumya ... Kirshanthan Sundararajah
02 Apr 2022
02 Apr 2022

TwinKernels: An execution model to improve GPU hardware scheduling at compile time
Xiang Gong ... Zhongliang Chen
-
Xiang Gong, et. al.Xiang Gong ... Zhongliang Chen
01 Feb 2017
01 Feb 2017

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Massively Parallel Server Processors

Abstract

Talk to us

Similar Papers

More From: IEEE Computer Architecture Letters