Natural Gradient Method Research Articles

Natural gradient learning is known to be efficient in escaping plateau, which is a main cause of the slow learning speed of neural networks. The adaptive natural gradient learning method for practical implementation also has been developed, and its advantage in real-world problems has been confirmed. In this letter, we deal with the generalization performances of the natural gradient method. Since natural gradient learning makes parameters fit to training data quickly,the overfitting phenomenon may easily occur, which results in poor generalization performance. To solve the problem, we introduce the regularization term in natural gradient learning and propose an efficient optimizing method for the scale of regularization by using a generalized Akaike information criterion (network information criterion). We discuss the properties of the optimized regularization strength by NIC through theoretical analysis as well as computer simulations. We confirm the computational efficiency and generalization performance of the proposed method in real-world applications through computational experiments on benchmark problems.

Blind source separation is the problem of extracting independent signals from their mixtures without knowing the mixing coefficients nor the probability distributions of source signals and may be applied to EEG and MEG imaging of the brain. It is already known that certain algorithms work well for the extraction of independent components. The present paper is concerned with superefficiency of these based on the statistical and dynamical analysis. In a statistical estimation using t examples, the covariance of any two extracted independent signals converges to 0 of the order of 1/t. On-line dynamics shows that the covariance is of the order of /spl eta/ when the learning rate /spl eta/ is fixed to a small constant. In contrast with the above general properties, a surprising superefficiency holds in blind source separation under certain conditions where superefficiency implies that covariance decreases in the order of 1/t/sup 2/ or of /spl eta//sup 2/. The paper uses the natural gradient learning algorithm and method of estimating functions to obtain superefficient procedures for both batch estimation and on-line learning. A standardized estimating function is introduced to this end. Superefficiency does not imply that the error variances of the extracted signals decrease in the order of 1/t/sup 2/ or /spl eta//sup 2/ but implies that their covariances (and independencies) do.

Natural Gradient Method Research Articles

Related Topics

Articles published on Natural Gradient Method

Improving generalization performance of natural gradient learning using optimized regularization by NIC.

Equivariant nonstationary source separation

Adaptive natural gradient learning algorithms for various stochastic models

Improving stability in blind source separation with stochastic median gradient

Superefficiency in blind source separation

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Natural Gradient Method Research Articles

Related Topics

Articles published on Natural Gradient Method

Improving generalization performance of natural gradient learning using optimized regularization by NIC.

Equivariant nonstationary source separation

Adaptive natural gradient learning algorithms for various stochastic models

Improving stability in blind source separation with stochastic median gradient

Superefficiency in blind source separation