Analysis of Deep Convolutional Neural Networks Using Tensor Kernels and Matrix-Based Entropy.

Kristoffer K Wickstrøm,Robert Jenssen,Michael C Kampffmeyer,José C Príncipe,Shujian Yu,Sigurd Løkse

doi:10.3390/e25060899

Abstract

Analyzing deep neural networks (DNNs) via information plane (IP) theory has gained tremendous attention recently to gain insight into, among others, DNNs' generalization ability. However, it is by no means obvious how to estimate the mutual information (MI) between each hidden layer and the input/desired output to construct the IP. For instance, hidden layers with many neurons require MI estimators with robustness toward the high dimensionality associated with such layers. MI estimators should also be able to handle convolutional layers while at the same time being computationally tractable to scale to large networks. Existing IP methods have not been able to study truly deep convolutional neural networks (CNNs). We propose an IP analysis using the new matrix-based Rényi's entropy coupled with tensor kernels, leveraging the power of kernel methods to represent properties of the probability distribution independently of the dimensionality of the data. Our results shed new light on previous studies concerning small-scale DNNs using a completely new approach. We provide a comprehensive IP analysis of large-scale CNNs, investigating the different training phases and providing new insights into the training dynamics of large-scale neural networks.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Analysis of Deep Convolutional Neural Networks Using Tensor Kernels and Matrix-Based Entropy.

Abstract

Talk to us

Similar Papers

More From: Entropy (Basel, Switzerland)

Lead the way for us

Journal: Entropy (Basel, Switzerland)	Publication Date: Jun 3, 2023
License type: CC BY 4.0

Similar Papers

Deep distributed convolutional neural networks: Universality
Ding-Xuan Zhou
Analysis and Applications | VOL. 16
Ding-Xuan ZhouDing-Xuan Zhou
01 Nov 2018
Analysis and Applications | VOL. 16

Scalable Mutual Information Estimation Using Dependence Graphs
Morteza Noshad ... Alfred O Hero
-
Morteza Noshad, et. al.Morteza Noshad ... Alfred O Hero
01 May 2019
01 May 2019

Convergence Behavior of DNNs with Mutual-Information-Based Regularization.
Hlynur Jónsson ... Giovanni Cherubini
Entropy | VOL. 22
Hlynur Jónsson, et. al.Hlynur Jónsson ... Giovanni Cherubini
30 Jun 2020
Entropy | VOL. 22

Stress detection using deep neural networks
Russell Li ... Zhandong Liu
BMC Medical Informatics and Decision Making | VOL. 20
Russell Li, et. al.Russell Li ... Zhandong Liu
01 Dec 2020
BMC Medical Informatics and Decision Making | VOL. 20

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Analysis of Deep Convolutional Neural Networks Using Tensor Kernels and Matrix-Based Entropy.

Abstract

Talk to us

Similar Papers

More From: Entropy (Basel, Switzerland)