ЖАДНОЕ ПОСЛОЙНОЕ ОБУЧЕНИЕ СВЕРТОЧНЫХ НЕЙРОННЫХ СЕТЕЙ

V.V Erokhin

doi:10.36871/2618-9976.2021.11.004

Abstract

Layer-by-layer training is an alternative end-to-end backpropagation approach for training deep convolutional neural networks. Layer-by-layer training on specific architectures can yield very competitive results. In the ImageNet database (www.imagenet. org), layer-by-layer trained networks can perform comparable to many modern end-to-end trained networks. This article compares the performance gap between the two training procedures over a wide range of network architectures and further analyzes the potential limitations of layer-by-layer training. The results show that layer-by-layer learning quickly saturates after a certain critical level due to overfitting of early levels in neural networks. Several approaches that have been used to solve this problem are discussed, and a methodology for improving layer-by-layer learning in various neural network architectures is discussed. Fundamentally, this research highlights the need to open up the black box that represents modern deep neural networks and explore the layer-by-layer interactions between intermediate hidden layers within deep networks through the lens of layer-by-layer learning.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

ЖАДНОЕ ПОСЛОЙНОЕ ОБУЧЕНИЕ СВЕРТОЧНЫХ НЕЙРОННЫХ СЕТЕЙ

Abstract

Talk to us

Similar Papers

More From: SOFT MEASUREMENTS AND COMPUTING

Lead the way for us

Similar Papers

A convergence analysis of Nesterov’s accelerated gradient method in training deep linear neural networks
Xin Liu ... Zhisong Pan
Information Sciences | VOL. 612
Xin Liu, et. al.Xin Liu ... Zhisong Pan
05 Sep 2022
Information Sciences | VOL. 612

An Intelligent Diagnosis Method of Brain MRI Tumor Segmentation Using Deep Convolutional Neural Network and SVM Algorithm.
Wentao Wu ... Jiaoyang Du
Computational and Mathematical Methods in Medicine | VOL. 2020
Wentao Wu, et. al.Wentao Wu ... Jiaoyang Du
14 Jul 2020
Computational and Mathematical Methods in Medicine | VOL. 2020

How SAR Image Denoise Affects the Performance of DCNN-Based Target Recognition Method
Jiaxin Tang ... Yongsheng Zhou
-
Jiaxin Tang, et. al.Jiaxin Tang ... Yongsheng Zhou
11 Jul 2021
11 Jul 2021

Effects of depth, width, and initialization: A convergence analysis of layer-wise training for deep linear neural networks
Yeonjong Shin
Analysis and Applications | VOL. 20
Yeonjong ShinYeonjong Shin
31 Dec 2021
Analysis and Applications | VOL. 20

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

ЖАДНОЕ ПОСЛОЙНОЕ ОБУЧЕНИЕ СВЕРТОЧНЫХ НЕЙРОННЫХ СЕТЕЙ

Abstract

Talk to us

Similar Papers

More From: SOFT MEASUREMENTS AND COMPUTING