A fast and efficient pre-training method based on layer-by-layer maximum discrimination for deep neural networks

Seyyede Zohreh Seyyedsalehi,Seyyed Ali Seyyedsalehi

doi:10.1016/j.neucom.2015.05.057

Abstract

In this paper, through extension of the present methods and based on error minimization, two fast and efficient layer-by-layer pre-training methods are proposed for initializing deep neural network (DNN) weights. Due to confrontation with a large number of local minima, DNN training often does not converge. By proper initializing of DNN weights instead of random values at the beginning of the training, it is possible to avoid many local minima. The first version of the proposed method is for pre-training the deep bottleneck neural network (DBNN) in which the DBNN is broken down to some corresponding single-hidden-layer bottleneck neural networks (BNN) which must be trained first. The weight values resulting from their training are then applied in the DBNN. The proposed method was utilized to pre-train a five-hidden-layer DBNN to extract the non-linear principal components of face images in the Bosphorus database. A comparison of the randomly initialized DBNN result with pre-trained DBNN by the layer-by-layer pre-training method shows that this method not only increased the convergence rate of training but also improved its generalizability. Furthermore, it has been shown that this method yields higher efficiency and convergence speed in comparison with some of the previous pre-training methods. This paper also presents the bidirectional version of the layer-by-layer pre-training method for hetero-associative DNN pre-training. This method pre-trains DNN weights in forward and backward manner in parallel. Bidirectional layer-by-layer pre-training was utilized to pre-train the classifier DNN weights, and revealed that both the training speed and the recognition rate were improved in Bosphorus and CK+ databases.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A fast and efficient pre-training method based on layer-by-layer maximum discrimination for deep neural networks

Abstract

Talk to us

Similar Papers

More From: Neurocomputing

Lead the way for us

Journal: Neurocomputing	Publication Date: May 22, 2015
Citations: 29

Similar Papers

CovidDeep: SARS-CoV-2/COVID-19 Test Based on Wearable Medical Sensors and Efficient Neural Networks
Shayan Hassantabar ... Kenza Hamidouche
IEEE Transactions on Consumer Electronics | VOL. 67
Shayan Hassantabar, et. al.Shayan Hassantabar ... Kenza Hamidouche
01 Nov 2021
IEEE Transactions on Consumer Electronics | VOL. 67

3D-NAND Flash Solid-State Drive (SSD) for Deep Neural Network Weight Storage of IoT Edge Devices with 700x Data-Retention Lifetime Extention
Yoshiaki Deguchi ... Ken Takeuchi
-
Yoshiaki Deguchi, et. al.Yoshiaki Deguchi ... Ken Takeuchi
01 May 2018
01 May 2018

Why Dose Layer-by-Layer Pre-training Improve Deep Neural Networks Learning?
Seyyede Zohreh Seyyedsalehi ... Seyyed Ali Seyyedsalehi
-
Seyyede Zohreh Seyyedsalehi, et. al.Seyyede Zohreh Seyyedsalehi ... Seyyed Ali Seyyedsalehi
01 Jan 2019
01 Jan 2019

Deep Neural Networks-Based Weight Approximation and Computation Reuse for 2-D Image Classification
Mohammed F Tolba ... Mahmoud Al-Qutayri
IEEE Access | VOL. 10
Mohammed F Tolba, et. al.Mohammed F Tolba ... Mahmoud Al-Qutayri
01 Jan 2021
IEEE Access | VOL. 10

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A fast and efficient pre-training method based on layer-by-layer maximum discrimination for deep neural networks

Abstract

Talk to us

Similar Papers

More From: Neurocomputing