NC-Net: Efficient Neuromorphic Computing Using Aggregated Subnets on a Crossbar-Based Architecture With Nonvolatile Memory

Tao Luo,Yingnan Cui,Rick Siow Mong Goh,Xuan Wang,Liwei Yang,Weng-Fai Wong,Chuping Qu,Huaipeng Zhang

doi:10.1109/tcad.2021.3120068

Tao Luo, Yingnan Cui + Show 6 more

https://doi.org/10.1109/tcad.2021.3120068

Copy DOI

Export

Save

Cite

Abstract
Full-Text
Similar Papers

Abstract

Listen

Neuromorphic computing chips consisting of crossbar arrays of emergent nonvolatile memory (NVM) have the potential of achieving both high energy efficiency and throughput as the low-power implementation of convolutional neural network (CNN) inference engines. However, such hardware has design constraints, such as its limited fan-in/fan-out and resource-inefficient mapping, that make the design and deployment of CNN on them challenging. As a result, the user has to design the CNN model with intricate knowledge of the hardware architecture and even cannot fit the models in the hardware for CNN with high resolution image input. In this article, we propose the use of <italic xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">aggregated subnets</i> , NC-net, which is a constrained form of the traditional layer structure, to solve these issues. With our method, we put forward an energy-efficient buffer- and analogue-to-digital converter and digital-to-analogue converter (ADC/DAC)-free architecture and a scalable end-to-end solution that automatically satisfies the hardware constraints of crossbar architectures, while optimizing the resource usage. In our solution, the exploration and deployment of a CNN for a neuromorphic crossbar hardware start with a design front end based on <italic xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">TensorFlow</i> . Our automated design flow maps the NC-net network from <italic xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">TensorFlow</i> to the crossbar architecture. We tested our designs on both a simulator and a field-programmable gate array (FPGA) emulator with various benchmarks. In addition to general benchmarks, including MNIST, SVHN, CIFAR-10, and CIFAR-100, we tested our system on a real-world application, human detection with high resolution (224 <inline-formula xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"> <tex-math notation="LaTeX">$\times $ </tex-math></inline-formula> 224) images as the input. Our system achieves the state-of-the-art accuracy for these benchmarks on the crossbar-based neuromorphic hardware, with an accuracy of more than 90% for the latter. It also yielded up to <inline-formula xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"> <tex-math notation="LaTeX">$4.25\times $ </tex-math></inline-formula> improvement in the efficiency of spiking core usage compared to TrueNorth.

Full Text

Published Version

Check institute access

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

NC-Net: Efficient Neuromorphic Computing Using Aggregated Subnets on a Crossbar-Based Architecture With Nonvolatile Memory

Abstract

Published Version

Talk to us

Similar Papers

More From: IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems

Lead the way for us

Journal: IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems	Publication Date: Sep 1, 2022
Citations: 9

Similar Papers

VLSI design and implementation of High-performance Binary-weighted convolutional artificial neural networks for embedded vision based Internet of Things (IoT)
Charles Rajesh Kumar J ... M.A Majid
Procedia Computer Science | VOL. 163
Charles Rajesh Kumar J, et. al.Charles Rajesh Kumar J ... M.A Majid
01 Jan 2019
Procedia Computer Science | VOL. 163

Aspects of programming for implementation of convolutional neural networks on multisystem HPC architectures
Sunil Pandey ... Shrish Verma
Journal of Physics: Conference Series | VOL. 2062
Sunil Pandey, et. al.Sunil Pandey ... Shrish Verma
01 Nov 2021
Journal of Physics: Conference Series | VOL. 2062

Design and Implementation of Configurable Convolutional Neural Network on FPGA
Huynh Vinh Phu ... Nguyen Van Hieu
-
Huynh Vinh Phu, et. al.Huynh Vinh Phu ... Nguyen Van Hieu
01 Dec 2019
01 Dec 2019

Real-Time Inference With 2D Convolutional Neural Networks on Field Programmable Gate Arrays for High-Rate Particle Imaging Detectors.
Yeon-Jae Jwa ... Luca Carloni
Frontiers in Artificial Intelligence | VOL. 5
Yeon-Jae Jwa, et. al.Yeon-Jae Jwa ... Luca Carloni
18 May 2022
Frontiers in Artificial Intelligence | VOL. 5

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

NC-Net: Efficient Neuromorphic Computing Using Aggregated Subnets on a Crossbar-Based Architecture With Nonvolatile Memory

Abstract

Published Version

Talk to us

Similar Papers

More From: IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems