MB-CNN: Memristive Binary Convolutional Neural Networks for Embedded Mobile Devices

Arjun Pal Chowdhury,Pranav Kulkarni,Mahdi Nazm Bojnordi

doi:10.3390/jlpea8040038

Arjun Pal Chowdhury, Pranav Kulkarni + Show 1 more

Open Access

https://doi.org/10.3390/jlpea8040038

Copy DOI

Abstract

Applications of neural networks have gained significant importance in embedded mobile devices and Internet of Things (IoT) nodes. In particular, convolutional neural networks have emerged as one of the most powerful techniques in computer vision, speech recognition, and AI applications that can improve the mobile user experience. However, satisfying all power and performance requirements of such low power devices is a significant challenge. Recent work has shown that binarizing a neural network can significantly improve the memory requirements of mobile devices at the cost of minor loss in accuracy. This paper proposes MB-CNN, a memristive accelerator for binary convolutional neural networks that perform XNOR convolution in-situ novel 2R memristive data blocks to improve power, performance, and memory requirements of embedded mobile devices. The proposed accelerator achieves at least 13.26 × , 5.91 × , and 3.18 × improvements in the system energy efficiency (computed by energy × delay) over the state-of-the-art software, GPU, and PIM architectures, respectively. The solution architecture which integrates CPU, GPU and MB-CNN outperforms every other configuration in terms of system energy and execution time.

Highlights

Sensor equipped Internet of Things (IoT) devices are expected to impact the future of consumer electronics
All of the relevant parameters of a trained binary convolutional neural network (B-convolutional neural network (CNN)) model are first written into the Resistive RAM (RRAM) arrays, which can be later used for inference tasks many times
The true and complement forms of every filter element are stored in a cell as b and b, respectively, where logic 1 is represented by the low resistance state (LRS) of the RRAM cell, and the high resistance state (HRS) is used for 0

Summary

Introduction

Sensor equipped Internet of Things (IoT) devices are expected to impact the future of consumer electronics. The increased demand of computation, memory, and energy consumption creates serious challenges to its applicability in IoT mobile devices. Recent studies have shown that the memory requirements of neural networks can be reduced by applying various compression and quantization techniques [7,8]. Binary neural network [9] and XNOR-Net [10] replace the power consuming floating point multiplications with bitwise XNOR operations. One bit quantization helps to achieve significant performance improvements for state-of-the-art neural networks. Because of its high density and low read latency, all the parameters of a huge neural network can be stored without impacting the overall area or performance of the device. The integration of MB-CNN accelerator with a single core CPU device provides 4.14× improvement in performance and 4× improvement in energy over software based single core CPU solution. As compared with GPU based and PIM-like accelerators, the proposed accelerator achieves at least 5.91× and 3.18× improvements in the system energy-delay-product, respectively

Convolutional Neural Networks

Binary CNN

Parameter Scaling

XNOR Convolution

Evolution in BCNN Algorithm

IoT Applications

Memristive Crosspoint Arrays

Memristive Binary Convolution

System Overview

Memristive XNOR Operation

Analog Bit-Count Operation

Hierarchical Bit-Counting

The MB-CNN Architecture

On-Chip Control

Bank Organization

C Full Adder b3

Array Structure

Data Organization

Experimental Setup

Neural Network Model

Architecture

Hardware–Software Integration

Design Space Exploration

Input Datasets

Evaluations

Performance

Energy

Comparison with Other Neural Network Accelerators

Neural Network Accelerators

CNN and DNN Hardware Accelerators

Compressing and Quantizing Network Parameters

Processing in Memory

Mobile IoT Applications

Findings

Conclusions

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Journal of Low Power Electronics and Applications	Publication Date: Oct 13, 2018
Citations: 16	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

MB-CNN: Memristive Binary Convolutional Neural Networks for Embedded Mobile Devices

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Journal of Low Power Electronics and Applications

Lead the way for us

Similar Papers

An Experimental Study of Reduced-Voltage Operation in Modern FPGAs for Neural Network Acceleration
Behzad Salami ... Erhan Baturay Onural
-
Behzad Salami, et. al.Behzad Salami ... Erhan Baturay Onural
01 Jun 2020
01 Jun 2020

Optimized Compression for Implementing Convolutional Neural Networks on FPGA
Min Zhang ... Linpeng Li
Electronics | VOL. 8
Min Zhang, et. al.Min Zhang ... Linpeng Li
06 Mar 2019
Electronics | VOL. 8

FPGA-based neural network accelerators for millimeter-wave radio-over-fiber systems.
Jeonghun Lee ... Jiayuan He
Optics Express | VOL. 28
Jeonghun Lee, et. al.Jeonghun Lee ... Jiayuan He
20 Apr 2020
Optics Express | VOL. 28

An Uninterrupted Processing Technique-Based High-Throughput and Energy-Efficient Hardware Accelerator for Convolutional Neural Networks
Md Najrul Islam ... Rahul Shrestha
IEEE Transactions on Very Large Scale Integration (VLSI) Systems | VOL. 30
Md Najrul Islam, et. al.Md Najrul Islam ... Rahul Shrestha
01 Dec 2022
IEEE Transactions on Very Large Scale Integration (VLSI) Systems | VOL. 30

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

MB-CNN: Memristive Binary Convolutional Neural Networks for Embedded Mobile Devices

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Journal of Low Power Electronics and Applications