Code Acceleration Using Memristor-Based Approximate Matrix Multiplier: Application to Convolutional Neural Networks

Mohsen Nourazar,Farshad Merrikh-Bayat,Ali Azarpeyvand,Vahid Rashtchi

doi:10.1109/tvlsi.2018.2837908

Abstract

In this paper, we demonstrate the feasibility of building a memristor-based approximate accelerator to be used in cooperation with general-purpose $\times 86$ processors. First, an integrated full system simulator is developed for simultaneous simulation of any multicrossbar architecture as an accelerator for $\times 86$ processors, which is performed by coupling a cycle accurate Marss $\times 86$ processor simulator with the Ngspice mixed-level/mixed-signal circuit simulator. Then, a novel mixed-signal memristor-based architecture is presented for multiplying floating-point signed complex numbers. The presented multiplier is extended for accelerating convolutional neural networks and finally, it is tightly integrated with the pipeline of a generic $\times 86$ processor. To validate the accelerator, first it is utilized for multiplying different matrices that vary in size and distribution. Then, it is used as an accelerator for accelerating the tiny-dnn, an open-source C++ implementation of deep learning neural networks. The memristor-based accelerator provides more than $100\times $ speedup and energy saving for a $64\times 64$ matrix-matrix multiplication, with an accuracy of 90%. Using the accelerated tiny-dnn for the MNIST database classification more than $10\times $ speedup and energy saving along with 95.51% pattern recognition accuracy is achieved.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Code Acceleration Using Memristor-Based Approximate Matrix Multiplier: Application to Convolutional Neural Networks

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Very Large Scale Integration (VLSI) Systems

Lead the way for us

Journal: IEEE Transactions on Very Large Scale Integration (VLSI) Systems	Publication Date: Dec 1, 2018
Citations: 48

Similar Papers

New dimensions in non‐classical neural computing, part II: quantum, nano, and optical
Anas N Al‐Rabadi
International Journal of Intelligent Computing and Cybernetics | VOL. 2
Anas N Al‐RabadiAnas N Al‐Rabadi
21 Aug 2009
International Journal of Intelligent Computing and Cybernetics | VOL. 2

Neuromorphic Computing with Memristor Crossbar
Xinjiang Zhang ... Anping Huang
physica status solidi (a) | VOL. 215
Xinjiang Zhang, et. al.Xinjiang Zhang ... Anping Huang
21 May 2018
physica status solidi (a) | VOL. 215

FPGA Parallel Implementation of CMAC Type Neural Network with on Chip Learning
S.T Brassai ... L Bako
-
S.T Brassai, et. al.S.T Brassai ... L Bako
01 May 2007
01 May 2007

Fault Diagnostics and Faulty Pattern Analysis of High-Speed Roller Bearings Using Deep Convolutional Neural Network
Maan Singh Rathore ... S P Harsha
Journal of Nondestructive Evaluation, Diagnostics and Prognostics of Engineering Systems | VOL. 6
Maan Singh Rathore, et. al.Maan Singh Rathore ... S P Harsha
01 May 2023
Journal of Nondestructive Evaluation, Diagnostics and Prognostics of Engineering Systems | VOL. 6

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Code Acceleration Using Memristor-Based Approximate Matrix Multiplier: Application to Convolutional Neural Networks

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Very Large Scale Integration (VLSI) Systems