A Latency-Optimized Reconfigurable NoC for In-Memory Acceleration of DNNs

Sumit K Mandal,Jae-Sun Seo,Yu Cao,Umit Y Ogras,Chaitali Chakrabarti,Gokul Krishnan

doi:10.1109/jetcas.2020.3015509

Sumit K Mandal, Jae-Sun Seo + Show 4 more

Open Access

https://doi.org/10.1109/jetcas.2020.3015509

Copy DOI

Abstract

In-memory computing reduces latency and energy consumption of Deep Neural Networks (DNNs) by reducing the number of off-chip memory accesses. However, crossbar-based in-memory computing may significantly increase the volume of on-chip communication since the weights and activations are on-chip. State-of-the-art interconnect methodologies for in-memory computing deploy a bus-based network or mesh-based Network-on-Chip (NoC). Our experiments show that up to 90% of the total inference latency of a DNN hardware is spent on on-chip communication when the bus-based network is used. To reduce the communication latency, we propose a methodology to generate an NoC architecture along with a scheduling technique customized for different DNNs. We prove mathematically that the generated NoC architecture and corresponding schedules achieve the minimum possible communication latency for a given DNN. Furthermore, we generalize the proposed solution for edge computing and cloud computing. Experimental evaluations on a wide range of DNNs show that the proposed NoC architecture enables 20%–80% reduction in communication latency with respect to state-of-the-art interconnect solutions.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE Journal on Emerging and Selected Topics in Circuits and Systems	Publication Date: Sep 1, 2020
Citations: 49	License type: publisher-specific-oa

R Discovery Prime

R Discovery Prime

A Latency-Optimized Reconfigurable NoC for In-Memory Acceleration of DNNs

Abstract

Talk to us

Similar Papers

More From: IEEE Journal on Emerging and Selected Topics in Circuits and Systems

Lead the way for us

Similar Papers

Impact of On-chip Interconnect on In-memory Acceleration of Deep Neural Networks
Gokul Krishnan ... Jae-Sun Seo
ACM Journal on Emerging Technologies in Computing Systems | VOL. 18
Gokul Krishnan, et. al.Gokul Krishnan ... Jae-Sun Seo
31 Dec 2021
ACM Journal on Emerging Technologies in Computing Systems | VOL. 18

Guest Editorial: Special Section on On-Chip Networks
L.-S Peh ... T.M Pinkston
IEEE Transactions on Parallel and Distributed Systems | VOL. 16
L.-S Peh, et. al.L.-S Peh ... T.M Pinkston
01 Feb 2005
IEEE Transactions on Parallel and Distributed Systems | VOL. 16

Interconnect-Centric Benchmarking of In-Memory Acceleration for DNNS
Gakul Krishnan ... Chaitali Chakrabarti
-
Gakul Krishnan, et. al.Gakul Krishnan ... Chaitali Chakrabarti
14 Mar 2021
14 Mar 2021

A Reliable Routing Architecture and Algorithm for Network-on-Chip
...
-
, et. al. ...
20 Nov 2015
20 Nov 2015

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Latency-Optimized Reconfigurable NoC for In-Memory Acceleration of DNNs

Abstract

Talk to us

Similar Papers

More From: IEEE Journal on Emerging and Selected Topics in Circuits and Systems