Simba

Yakun Sophia Shao,Brucek Khailany,William J Dally,Yanqing Zhang,Nathaniel Pinckney,Jason Cemons,Stephen W Keckler,Alicia Klinefelter,Nan Jiang,Stephen G Tell,Rangharajan Venkatesan,Joel Emer,Ben Keller,C Thomas Gray,Brian Zimmer,Priyanka Raina,Matthew Fojtik

doi:10.1145/3460227

Abstract

Package-level integration using multi-chip-modules (MCMs) is a promising approach for building large-scale systems. Compared to a large monolithic die, an MCM combines many smaller chiplets into a larger system, substantially reducing fabrication and design costs. Current MCMs typically only contain a handful of coarse-grained large chiplets due to the high area, performance, and energy overheads associated with inter-chiplet communication. This work investigates and quantifies the costs and benefits of using MCMs with finegrained chiplets for deep learning inference, an application domain with large compute and on-chip storage requirements. To evaluate the approach, we architected, implemented, fabricated, and tested Simba, a 36-chiplet prototype MCM system for deep-learning inference. Each chiplet achieves 4 TOPS peak performance, and the 36-chiplet MCM package achieves up to 128 TOPS and up to 6.1 TOPS/W. The MCM is configurable to support a flexible mapping of DNN layers to the distributed compute and storage units. To mitigate inter-chiplet communication overheads, we introduce three tiling optimizations that improve data locality. These optimizations achieve up to 16% speedup compared to the baseline layer mapping. Our evaluation shows that Simba can process 1988 images/s running ResNet-50 with a batch size of one, delivering an inference latency of 0.50 ms.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Simba

Abstract

Talk to us

Similar Papers

More From: Communications of the ACM

Lead the way for us

Journal: Communications of the ACM	Publication Date: May 24, 2021
Citations: 7

Similar Papers

Simba
Yakun Sophia Shao ... Nathaniel Pinckney
-
Yakun Sophia Shao, et. al.Yakun Sophia Shao ... Nathaniel Pinckney
12 Oct 2019
12 Oct 2019

A novel routing algorithm for MCM substrate verification using single-ended probe
Rongchang Yan ... B.C Kim
-
Rongchang Yan, et. al. Rongchang Yan ... B.C Kim
26 Apr 1998
26 Apr 1998

(Invited) Surface Activated Bonding Method for Low Temperature Bonding
Tadatomo Suga
Electrochemical Society Meeting Abstracts | VOL. MA2018-02
Tadatomo SugaTadatomo Suga
23 Jul 2018
Electrochemical Society Meeting Abstracts | VOL. MA2018-02

The experimental measurement of wire sag of long wire bonds for 3-dimensional and multi-chip module packaging
Huang-Kuang Kung ... Hsiang-Chen Hsu
-
Huang-Kuang Kung, et. al.Huang-Kuang Kung ... Hsiang-Chen Hsu
01 Oct 2009
01 Oct 2009

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Simba

Abstract

Talk to us

Similar Papers

More From: Communications of the ACM