Information Bottleneck Theory Based Exploration of Cascade Learning.

Xin Du,Mahesan Niranjan,Katayoun Farrahi

doi:10.3390/e23101360

Abstract

In solving challenging pattern recognition problems, deep neural networks have shown excellent performance by forming powerful mappings between inputs and targets, learning representations (features) and making subsequent predictions. A recent tool to help understand how representations are formed is based on observing the dynamics of learning on an information plane using mutual information, linking the input to the representation () and the representation to the target (). In this paper, we use an information theoretical approach to understand how Cascade Learning (CL), a method to train deep neural networks layer-by-layer, learns representations, as CL has shown comparable results while saving computation and memory costs. We observe that performance is not linked to information–compression, which differs from observation on End-to-End (E2E) learning. Additionally, CL can inherit information about targets, and gradually specialise extracted features layer-by-layer. We evaluate this effect by proposing an information transition ratio, , and show that it can serve as a useful heuristic in setting the depth of a neural network that achieves satisfactory accuracy of classification.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Entropy (Basel, Switzerland)	Publication Date: Oct 18, 2021
Citations: 1	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Information Bottleneck Theory Based Exploration of Cascade Learning.

Abstract

Talk to us

Similar Papers

More From: Entropy (Basel, Switzerland)

Lead the way for us

Similar Papers

Transfer learning across human activities using a cascade neural network architecture
Xin Du ... Katayoun Farrahi
-
Xin Du, et. al.Xin Du ... Katayoun Farrahi
09 Sep 2019
09 Sep 2019

A convergence analysis of Nesterov’s accelerated gradient method in training deep linear neural networks
Xin Liu ... Zhisong Pan
Information Sciences | VOL. 612
Xin Liu, et. al.Xin Liu ... Zhisong Pan
05 Sep 2022
Information Sciences | VOL. 612

A Deep Representation Learning Framework for Medical Imaging Data Analysis
Pengcheng Xi
-
Pengcheng XiPengcheng Xi
24 Jun 2020
24 Jun 2020

Analysis of Deep Convolutional Neural Networks Using Tensor Kernels and Matrix-Based Entropy.
Kristoffer K Wickstrøm ... Sigurd Løkse
Entropy (Basel, Switzerland) | VOL. 25
Kristoffer K Wickstrøm, et. al.Kristoffer K Wickstrøm ... Sigurd Løkse
03 Jun 2023
Entropy (Basel, Switzerland) | VOL. 25

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Information Bottleneck Theory Based Exploration of Cascade Learning.

Abstract

Talk to us

Similar Papers

More From: Entropy (Basel, Switzerland)