VLSI Architectures for the Restricted Boltzmann Machine

Bo Yuan,Keshab K Parhi

doi:10.1145/3007193

Abstract

Neural network (NN) systems are widely used in many important applications ranging from computer vision to speech recognition. To date, most NN systems are processed by general processing units like CPUs or GPUs. However, as the sizes of dataset and network rapidly increase, the original software implementations suffer from long training time. To overcome this problem, specialized hardware accelerators are needed to design high-speed NN systems. This article presents an efficient hardware architecture of restricted Boltzmann machine (RBM) that is an important category of NN systems. Various optimization approaches at the hardware level are performed to improve the training speed. As-soon-as-possible and overlapped-scheduling approaches are used to reduce the latency. It is shown that, compared with the flat design, the proposed RBM architecture can achieve 50% reduction in training time. In addition, an on-the-fly computation scheme is also used to reduce the storage requirement of binary and stochastic states by several hundreds of times. Then, based on the proposed approach, a 784-2252 RBM design example is developed for MNIST handwritten digit recognition dataset. Analysis shows that the VLSI design of RBM achieves significant improvement in training speed and energy efficiency as compared to CPU/GPU-based solution.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

VLSI Architectures for the Restricted Boltzmann Machine

Abstract

Talk to us

Similar Papers

More From: ACM Journal on Emerging Technologies in Computing Systems

Lead the way for us

Journal: ACM Journal on Emerging Technologies in Computing Systems	Publication Date: May 12, 2017
Citations: 5

Similar Papers

Privacy-Preserving Deep Learning Framework Based on Restricted Boltzmann Machines and Instance Reduction Algorithms
Alya Alshammari ... Khalil El Hindi
Applied Sciences | VOL. 14
Alya Alshammari, et. al.Alya Alshammari ... Khalil El Hindi
01 Feb 2024
Applied Sciences | VOL. 14

Pipelined parallel contrastive divergence for continuous generative model learning
Bruno U Pedroni ... Sadique Sheik
-
Bruno U Pedroni, et. al.Bruno U Pedroni ... Sadique Sheik
01 May 2017
01 May 2017

Comparison of the GRNN and BP neural network for the prediction of populus (P.×euramericana cv.“74/76”) seedlings' water consumption
Wei-Dong Gao ... Yang-Cui Ning
-
Wei-Dong Gao, et. al.Wei-Dong Gao ... Yang-Cui Ning
01 Aug 2010
01 Aug 2010

Cybersecurity of multi-cloud healthcare systems: A hierarchical deep learning approach
Lav Gupta ... Raj Jain
Applied Soft Computing | VOL. 118
Lav Gupta, et. al.Lav Gupta ... Raj Jain
12 Jan 2022
Applied Soft Computing | VOL. 118

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

VLSI Architectures for the Restricted Boltzmann Machine

Abstract

Talk to us

Similar Papers

More From: ACM Journal on Emerging Technologies in Computing Systems