A robust estimator of mutual information for deep learning interpretability

Davide Piras,Andrew Pontzen,Ningyuan Guo,Luisa Lucie-Smith,Hiranya V Peiris,Brian Nord

doi:10.1088/2632-2153/acc444

Abstract

We develop the use of mutual information (MI), a well-established metric in information theory, to interpret the inner workings of deep learning (DL) models. To accurately estimate MI from a finite number of samples, we present GMM-MI (pronounced ‘Jimmie’), an algorithm based on Gaussian mixture models that can be applied to both discrete and continuous settings. GMM-MI is computationally efficient, robust to the choice of hyperparameters and provides the uncertainty on the MI estimate due to the finite sample size. We extensively validate GMM-MI on toy data for which the ground truth MI is known, comparing its performance against established MI estimators. We then demonstrate the use of our MI estimator in the context of representation learning, working with synthetic data and physical datasets describing highly non-linear processes. We train DL models to encode high-dimensional data within a meaningful compressed (latent) representation, and use GMM-MI to quantify both the level of disentanglement between the latent variables, and their association with relevant physical quantities, thus unlocking the interpretability of the latent representation. We make GMM-MI publicly available in this GitHub repository.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Machine Learning: Science and Technology	Publication Date: Apr 11, 2023
Citations: 10	License type: cc-by

R Discovery Prime

R Discovery Prime

A robust estimator of mutual information for deep learning interpretability

Abstract

Talk to us

Similar Papers

More From: Machine Learning: Science and Technology

Lead the way for us

Similar Papers

On the Calculation of Mutual Information
Tyrone E Duncan
SIAM Journal on Applied Mathematics | VOL. 19
Tyrone E DuncanTyrone E Duncan
01 Jul 1970
SIAM Journal on Applied Mathematics | VOL. 19

Multifeature mutual information
Dejan Tomazevic ... Bostjan Likar
-
Dejan Tomazevic, et. al.Dejan Tomazevic ... Bostjan Likar
12 May 2004
12 May 2004

Estimation of mutual information by the fuzzy histogram
Maryam Amir Haeri ... Mohammad Mehdi Ebadzadeh
Fuzzy Optimization and Decision Making | VOL. 13
Maryam Amir Haeri, et. al.Maryam Amir Haeri ... Mohammad Mehdi Ebadzadeh
13 Feb 2014
Fuzzy Optimization and Decision Making | VOL. 13

Neural Mutual Information Estimation for Channel Coding: State-of-the-Art Estimators, Analysis, and Performance Comparison
Rick Fritschek ... Gerhard Wunder
-
Rick Fritschek, et. al.Rick Fritschek ... Gerhard Wunder
01 May 2020
01 May 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A robust estimator of mutual information for deep learning interpretability

Abstract

Talk to us

Similar Papers

More From: Machine Learning: Science and Technology