A topological insight into restricted Boltzmann machines

Decebal Constantin Mocanu,Elena Mocanu,Phuong H. Nguyen,Antonio Liotta,Madeleine Gibescu

doi:10.1007/s10994-016-5570-z

Decebal Constantin Mocanu, Elena Mocanu + Show 3 more

Open Access

https://doi.org/10.1007/s10994-016-5570-z

Copy DOI

Journal: Machine Learning	Publication Date: Jul 15, 2016
Citations: 66	License type: CC BY 4.0

Affiliation: Eindhoven University of Technology

Abstract

Restricted Boltzmann Machines (RBMs) and models derived from them have been successfully used as basic building blocks in deep artificial neural networks for automatic features extraction, unsupervised weights initialization, but also as density estimators. Thus, their generative and discriminative capabilities, but also their computational time are instrumental to a wide range of applications. Our main contribution is to look at RBMs from a topological perspective, bringing insights from network science. Firstly, here we show that RBMs and Gaussian RBMs (GRBMs) are bipartite graphs which naturally have a small-world topology. Secondly, we demonstrate both on synthetic and real-world datasets that by constraining RBMs and GRBMs to a scale-free topology (while still considering local neighborhoods and data distribution), we reduce the number of weights that need to be computed by a few orders of magnitude, at virtually no loss in generative performance. Thirdly, we show that, for a fixed number of weights, our proposed sparse models (which by design have a higher number of hidden neurons) achieve better generative capabilities than standard fully connected RBMs and GRBMs (which by design have a smaller number of hidden neurons), at no additional computational costs.

Highlights

Since its conception, deep learning (Bengio 2009) is widely studied and applied, from pure academic research to large-scale industrial applications, due to its success in different real-world machine learning problems such as audio recognition (Lee et al 2009), reinforcement learning (Mnih et al 2015), transfer learning (Ammar et al 2013), and activity recognition (Mocanu et al 2015)
The main contribution of this paper is to look at the deep learning basic building blocks, i.e. Restricted Boltzmann Machines (RBMs) and Gaussian RBMs (GRBMs) (Hinton and Salakhutdinov 2006), from a topological perspective, bringing insights from network science, an extension of graph theory which analyzes real world complex networks (Strogatz 2001)
In the last two sets of experiments, we compare Gaussian compleX Boltzmann Machine (GXBM)/XBM against three other methods, as follows: (1) the standard fully connected GRBM/RBM; (2) sparse GRBM/RBM models, denoted further GRBMFixProb (Fixed Probability)/RBMFixProb, in which the probability for any possible connection to exist is set to the number of weights of the counterpart GXBM/XBM model divided by the total number of possible connection for that specific configuration of hidden and visible neurons;4 and (3) sparse GRBM/RBM models, denoted further GRBMTrPrTr (Train Prune Train)/RBMTrPrTr, in which the sparsity is obtained using the algorithm introduced in Han et al (2015) with L2 regularization, and in which the weights sparsity target is set to the number of weights of the counterpart GXBM/XBM model

Summary

Introduction

Deep learning (Bengio 2009) is widely studied and applied, from pure academic research to large-scale industrial applications, due to its success in different real-world machine learning problems such as audio recognition (Lee et al 2009), reinforcement learning (Mnih et al 2015), transfer learning (Ammar et al 2013), and activity recognition (Mocanu et al 2015). Deep learning models are artificial neural networks with multiple layers of hidden neurons, which have connections only among neurons belonging to consecutive layers, but have no connections within the same layers These models are composed by basic building blocks, such as Restricted Boltzmann Machines (RBMs) (Smolensky 1987). To formalize a Boltzmann machine, and its variants, three main ingredients are required, namely an energy function providing scalar values for a given configuration of the network, the probabilistic inference and the learning rules required for fitting the free parameters This bidirectional connected network with stochastic nodes has no unit connected with itself. The model architecture was restricted by not allowing intra-layer connections between the units, as depicted in Fig. 2 (left) Since their conception, different types of Boltzmann machines have been developed and successfully applied.

Objectives

Results

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A topological insight into restricted Boltzmann machines

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Machine Learning

Lead the way for us

Similar Papers

Robust spike-and-slab deep Boltzmann machines for face denoising
Nan Zhang ... Shifei Ding
Neural Computing and Applications | VOL. 32
Nan Zhang, et. al.Nan Zhang ... Shifei Ding
19 Nov 2018
Neural Computing and Applications | VOL. 32

Mean-Field Inference in Gaussian Restricted Boltzmann Machine
Chako Takahashi ... Muneki Yasuda
Journal of the Physical Society of Japan | VOL. 85
Chako Takahashi, et. al.Chako Takahashi ... Muneki Yasuda
15 Mar 2016
Journal of the Physical Society of Japan | VOL. 85

Deep Adaptive Networks for Visual Data Classification
Shusen Zhou ... Qingcai Chen
Journal of Multimedia | VOL. 9
Shusen Zhou, et. al.Shusen Zhou ... Qingcai Chen
21 Oct 2014
Journal of Multimedia | VOL. 9

Classification of Autism Gene Expression Data Using Deep Learning
Noura Samy ... Nahla A Belal
-
Noura Samy, et. al.Noura Samy ... Nahla A Belal
11 Nov 2019
11 Nov 2019

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A topological insight into restricted Boltzmann machines

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Machine Learning