Regularized Stacked Auto-Encoder Based Pre-training for Generalization of Multi-layer Perceptron

Prasenjit Dey,Abhijit Ghosh,Tandra Pal

doi:10.1007/978-3-319-71069-3_18

Abstract

Generalization capability of multi-layer perceptron (MLP) depends on the initialization of its weights. If the weights of an MLP are not initialized properly, it may fail to achieve good generalization. In this article, we propose a weight initialization technique for MLP to improve its generalization. This is achieved by a regularized stacked auto-encoder based pre-training method. During pre-training, the weights between each adjacent layers of an MLP, upto the penultimate layer, are trained layer wise by an auto-encoder. To train the auto-encoder, we use weighted sum of two terms: (i) mean squared error (MSE) and (ii) sum of squares of the first order derivatives of the outputs with respect to inputs. Here, the second term acts as a regularizer. It is used to penalize the training of auto-encoder during pre-training to generate better initial values of the weights for each successive layers of MLP. To compare the proposed initialization technique with random weight initialization, we have considered ten standard classification data sets. Empirical results show that the proposed initialization technique improves the generalization of MLP.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Regularized Stacked Auto-Encoder Based Pre-training for Generalization of Multi-layer Perceptron

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Deep Learning-Based Weight Initialization on Multi-layer Perceptron for Image Recognition
Sourabrata Mukherjee ... Prasenjit Dey
-
Sourabrata Mukherjee, et. al.Sourabrata Mukherjee ... Prasenjit Dey
01 Jan 2023
01 Jan 2023

A Neural Networks Design Methodology for Detecting Loss of Coolant Accidents in Nuclear Power Plants
David Tian ... Hissam Tawfik
-
David Tian, et. al.David Tian ... Hissam Tawfik
01 Jan 2018
01 Jan 2018

Discriminative Regularized Input Manifold for multilayer perceptron
Rahul Mondal ... Prasenjit Dey
Pattern Recognition | VOL. 151
Rahul Mondal, et. al.Rahul Mondal ... Prasenjit Dey
11 Mar 2024
Pattern Recognition | VOL. 151

On the PDF Estimation for Information Theoretic Learning for Neural Networks
Tokunbo Ogunfunmi ... Manas Deb
-
Tokunbo Ogunfunmi, et. al.Tokunbo Ogunfunmi ... Manas Deb
01 Nov 2018
01 Nov 2018

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Regularized Stacked Auto-Encoder Based Pre-training for Generalization of Multi-layer Perceptron

Abstract

Talk to us

Similar Papers