Variational Bayes Solution of Linear Neural Networks and Its Generalization Performance

Shinichi Nakajima,Sumio Watanabe

doi:10.1162/neco.2007.19.4.1112

Abstract

It is well known that in unidentifiable models, the Bayes estimation provides much better generalization performance than the maximum likelihood (ML) estimation. However, its accurate approximation by Markov chain Monte Carlo methods requires huge computational costs. As an alternative, a tractable approximation method, called the variational Bayes (VB) approach, has recently been proposed and has been attracting attention. Its advantage over the expectation maximization (EM) algorithm, often used for realizing the ML estimation, has been experimentally shown in many applications; nevertheless, it has not yet been theoretically shown. In this letter, through analysis of the simplest unidentifiable models, we theoretically show some properties of the VB approach. We first prove that in three-layer linear neural networks, the VB approach is asymptotically equivalent to a positive-part James-Stein type shrinkage estimation. Then we theoretically clarify its free energy, generalization error, and training error. Comparing them with those of the ML estimation and the Bayes estimation, we discuss the advantage of the VB approach. We also show that unlike in the Bayes estimation, the free energy and the generalization error are less simply related with each other and that in typical cases, the VB free energy well approximates the Bayes one, while the VB generalization error significantly differs from the Bayes one.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Variational Bayes Solution of Linear Neural Networks and Its Generalization Performance

Abstract

Talk to us

Similar Papers

More From: Neural Computation

Lead the way for us

Journal: Neural Computation	Publication Date: Apr 1, 2007
Citations: 84

Similar Papers

Analytic Solution of Hierarchical Variational Bayes in Linear Inverse Problem
Shinichi Nakajima ... Sumio Watanabe
-
Shinichi Nakajima, et. al.Shinichi Nakajima ... Sumio Watanabe
01 Jan 2006
01 Jan 2006

Generalization Performance of Subspace Bayes Approach in Linear Neural Networks
S Nakajima
IEICE Transactions on Information and Systems | VOL. E89-D
S NakajimaS Nakajima
01 Mar 2006
IEICE Transactions on Information and Systems | VOL. E89-D

Bayesian estimation for geometric process with the Weibull distribution
Ilhan Usta
Communications in Statistics - Simulation and Computation | VOL. 53
Ilhan UstaIlhan Usta
23 May 2022
Communications in Statistics - Simulation and Computation | VOL. 53

Bayesian Inference for Geometric Process with Lindley Distribution and its Applications
Asuman Yılmaz ... Mahmut Kara
Fluctuation and Noise Letters | VOL. 21
Asuman Yılmaz, et. al.Asuman Yılmaz ... Mahmut Kara
05 Aug 2022
Fluctuation and Noise Letters | VOL. 21

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Variational Bayes Solution of Linear Neural Networks and Its Generalization Performance

Abstract

Talk to us

Similar Papers

More From: Neural Computation