Restricted Boltzmann Machines and Deep Belief Networks on multi-core processors

Noel Lopes,Joao Goncalves,Bernardete Ribeiro

doi:10.1109/ijcnn.2012.6252431

Abstract

Deep learning architecture models by contrast with shallow models draw on the insights of biological inspiration which has been a challenge since the inception of the idea of simulating the brain. In particular their (many) hierarchical levels of composition track the development of parallel implementation in an attempt to become accessibly fast. When it comes to performance enhancement Graphics Processing Units (GPU) have carved their own strength in machine learning. In this paper, we present an approach that relies mainly on three kernels for implementing both the Restricted Boltzmann Machines (RBM) and Deep Belief Networks (DBN) algorithms. Instead of considering the neuron as the smallest unit of computation each thread represents the connection between two (one visible and one hidden) neurons. Although conceptually it may seem weird, the rationale behind is to think of a connection as performing a simple function that multiplies the clamped input by its weight. Thus, we maximize the GPU workload avoiding idle cores. Moreover, we placed great emphasis on the kernels to avoid uncoalesced memory accesses as well as to take advantage of the shared memory to reduce global memory accesses. Additionally, our approach uses a step adaptive learning rate procedure which accelerates convergence. The approach yields very good speedups (up to 46×) as compared with a straightforward implementation when both GPU and CPU implementations are tested on the MINST database.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Restricted Boltzmann Machines and Deep Belief Networks on multi-core processors

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Fast evaluation of Helmholtz potential on graphics processing units (GPUs)
Shaojing Li ... Vitaliy Lomakin
Journal of Computational Physics | VOL. 229
Shaojing Li, et. al.Shaojing Li ... Vitaliy Lomakin
03 Aug 2010
Journal of Computational Physics | VOL. 229

Estimating numerical error in neural network simulations on Graphics Processing Units
James P Turner ... Thomas Nowotny
BMC Neuroscience | VOL. 16
James P Turner, et. al.James P Turner ... Thomas Nowotny
01 Dec 2015
BMC Neuroscience | VOL. 16

GPU Accelerated Multilevel Lagrangian Carotid Strain Imaging.
Nirvedh H Meshram ... Tomy Varghese
IEEE transactions on ultrasonics, ferroelectrics, and frequency control | VOL. 65
Nirvedh H Meshram, et. al.Nirvedh H Meshram ... Tomy Varghese
28 May 2018
IEEE transactions on ultrasonics, ferroelectrics, and frequency control | VOL. 65

Real-world comparison of CPU and GPU implementations of SNPrank: a network analysis tool for GWAS
Nicholas A. Davis ... B. A. McKinney
Bioinformatics | VOL. 27
Nicholas A. Davis, et. al.Nicholas A. Davis ... B. A. McKinney
25 Nov 2010
Bioinformatics | VOL. 27

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Restricted Boltzmann Machines and Deep Belief Networks on multi-core processors

Abstract

Talk to us

Similar Papers