3D-ReG

Bing Li,Partha Pratim Pande,Joe X Qiu,Hai (Helen) Li,Krishnendu Chakrabarty,Janardhan Rao Doppa

doi:10.1145/3375699

Abstract

Deep neural network (DNN) models are being expanded to a broader range of applications. The computational capability of traditional hardware platforms cannot accommodate the growth of model complexity. Among recent technologies to accelerate DNN, resistive memory (ReRAM)-based processing-in-memory (PIM) emerged as a promising solution for DNN inference due to its high efficiency for matrix-based computation. We face two major technical challenges in extending the use of ReRAM-based accelerators for training: (1) full-precision data is essential in back-propagation; (2) the need to support both feed-forward and back-propagation aggravates the data-movement burden. We propose a heterogeneous architecture named as 3D-ReG, which leverages full-precision GPU to ensure training accuracy and low-overhead 3D integration to provide low-cost data movements. Moreover, we introduce conservative and aggressive task-mapping schemes, which partition the computation phases in different ways to balance execution efficiency and training accuracy. We evaluate 3D-ReG implemented with two 3D integration technologies, through-silicon vias (TSVs) and monolithic inter-tier vias (MIVs), and compare them with GPU-only and PIM-only counterparts. Various GPU-only platforms using two main-memory technologies (DRAM, ReRAM) and three interconnect technologies (2D, TSV, MIV) are evaluated as well. Experimental results show that 3D-ReG can achieve on average 5.64× training speedup and 3.56× higher energy efficiency compared with the GPU with DRAM as main memory, at the cost of 0.05%–3.39% accuracy drop. We define a new metric, gain-loss ratio (GLR), which quantitatively evaluates the capability of a DNN training hardware in terms of the model accuracy and hardware efficiency. The results of our comparison show that the aggressive task-mapping scheme on MIV-based 3D-ReG outperforms the other methods.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

3D-ReG

Abstract

Talk to us

Similar Papers

More From: ACM Journal on Emerging Technologies in Computing Systems

Lead the way for us

Journal: ACM Journal on Emerging Technologies in Computing Systems	Publication Date: Jan 30, 2020
Citations: 16

Similar Papers

Routing Complexity Minimization of Monolithic Three-Dimensional Integrated Circuits
Sheng-En David Lin ... Dae Hyun Kim
-
Sheng-En David Lin, et. al.Sheng-En David Lin ... Dae Hyun Kim
01 Mar 2019
01 Mar 2019

Quantifying the Impact of Monolithic 3D (M3D) Integration on L1 Caches
Young-Ho Gong ... Joonho Kong
IEEE Transactions on Emerging Topics in Computing | VOL. 9
Young-Ho Gong, et. al.Young-Ho Gong ... Joonho Kong
01 Apr 2021
IEEE Transactions on Emerging Topics in Computing | VOL. 9

HeM3D
Aqeeb Iqbal Arka ... Partha Pratim Pande
ACM Transactions on Design Automation of Electronic Systems | VOL. 26
Aqeeb Iqbal Arka, et. al.Aqeeb Iqbal Arka ... Partha Pratim Pande
17 Feb 2021
ACM Transactions on Design Automation of Electronic Systems | VOL. 26

3D Wafer Level Heterogeneous Integration
M Juergen Wolf ... Klaus-Dieter Lang
Electrochemical Society Meeting Abstracts | VOL. MA2014-02
M Juergen Wolf, et. al.M Juergen Wolf ... Klaus-Dieter Lang
05 Aug 2014
Electrochemical Society Meeting Abstracts | VOL. MA2014-02

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

3D-ReG

Abstract

Talk to us

Similar Papers

More From: ACM Journal on Emerging Technologies in Computing Systems