A simple theory for training response of deep neural networks

Kenichi Nakazato

doi:10.1088/1402-4896/ad49dc

Abstract

Deep neural networks give us a powerful method to model the training dataset’s relationship between input and output. We can regard that as a complex adaptive system consisting of many artificial neurons that work as an adaptive memory as a whole. The network’s behavior is training dynamics with a feedback loop from the evaluation of the loss function. We already know the training response can be constant or shows power law-like aging in some ideal situations. However, we still have gaps between those findings and other complex phenomena, like network fragility. To fill the gap, we introduce a very simple network and analyze it. We show the training response consists of some different factors based on training stages, activation functions, or training methods. In addition, we show feature space reduction as an effect of stochastic training dynamics, which can result in network fragility. Finally, we discuss some complex phenomena of deep networks.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A simple theory for training response of deep neural networks

Abstract

Talk to us

Similar Papers

More From: Physica Scripta

Lead the way for us

Journal: Physica Scripta	Publication Date: May 23, 2024
License type: iop-standard

Similar Papers

Flatness Prediction of Cold Rolled Strip Based on Deep Neural Network with Improved Activation Function.
Jingyi Liu ... Sen Li
Sensors | VOL. 22
Jingyi Liu, et. al.Jingyi Liu ... Sen Li
15 Jan 2022
Sensors | VOL. 22

On random matrices arising in deep neural networks: General I.I.D. case
Leonid Pastur ... Victor Slavin
Random Matrices: Theory and Applications | VOL. 12
Leonid Pastur, et. al.Leonid Pastur ... Victor Slavin
14 Jul 2022
Random Matrices: Theory and Applications | VOL. 12

Refining the Efficiency of R-CNN in Pedestrian Detection
Katleho L Masita ... Thokozani Shongwe
-
Katleho L Masita, et. al.Katleho L Masita ... Thokozani Shongwe
10 Sep 2021
10 Sep 2021

Fine Tuned Deep Neural Networks for Intrusion Detection System
D P Gaikwad ... Amir Mukeri
Journal of Network Security Computer Networks | VOL. 06
D P Gaikwad, et. al.D P Gaikwad ... Amir Mukeri
06 Jun 2020
Journal of Network Security Computer Networks | VOL. 06

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A simple theory for training response of deep neural networks

Abstract

Talk to us

Similar Papers

More From: Physica Scripta