Ensemble Estimation of Generalized Mutual Information With Applications to Genomics

Kevin R Moon,Alfred O Hero,Kumar Sricharan

doi:10.1109/tit.2021.3100108

Abstract

Mutual information is a measure of the dependence between random variables that has been used successfully in myriad applications in many fields. Generalized mutual information measures that go beyond classical Shannon mutual information have also received much interest in these applications. We derive the mean squared error convergence rates of kernel density-based plug-in estimators of general mutual information measures between two multidimensional random variables $\mathbf{X}$ and $\mathbf{Y}$ for two cases: 1) $\mathbf{X}$ and $\mathbf{Y}$ are continuous; 2) $\mathbf{X}$ and $\mathbf{Y}$ may have any mixture of discrete and continuous components. Using the derived rates, we propose an ensemble estimator of these information measures called GENIE by taking a weighted sum of the plug-in estimators with varied bandwidths. The resulting ensemble estimators achieve the $1/N$ parametric mean squared error convergence rate when the conditional densities of the continuous variables are sufficiently smooth. To the best of our knowledge, this is the first nonparametric mutual information estimator known to achieve the parametric convergence rate for the mixture case, which frequently arises in applications (e.g. variable selection in classification). The estimator is simple to implement and it uses the solution to an offline convex optimization problem and simple plug-in estimators. A central limit theorem is also derived for the ensemble estimators and minimax rates are derived for the continuous case. We demonstrate the ensemble estimator for the mixed case on simulated data and apply the proposed estimator to analyze gene relationships in single cell data.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE Transactions on Information Theory	Publication Date: Sep 1, 2021
Citations: 5	License type: publisher-specific-oa

R Discovery Prime

R Discovery Prime

Ensemble Estimation of Generalized Mutual Information With Applications to Genomics

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Information Theory

Lead the way for us

Similar Papers

Ensemble estimation of mutual information
Kevin R Moon ... Kumar Sricharan
-
Kevin R Moon, et. al.Kevin R Moon ... Kumar Sricharan
01 Jun 2017
01 Jun 2017

Some Convex Functions Based Measures of Independence and Their Application to Strange Attractor Reconstruction
Yang Chen ... Kazuyuki Aihara
Entropy | VOL. 13
Yang Chen, et. al.Yang Chen ... Kazuyuki Aihara
08 Apr 2011
Entropy | VOL. 13

Estimation of Interclass Correlation from Familial Data
B Rosner ... C H Hennekens
Applied Statistics | VOL. 26
B Rosner, et. al.B Rosner ... C H Hennekens
01 Jan 1976
Applied Statistics | VOL. 26

Ensemble estimation and variable selection with semiparametric regression models.
Sunyoung Shin ... Jason P Fine
Biometrika | VOL. 107
Sunyoung Shin, et. al.Sunyoung Shin ... Jason P Fine
15 Apr 2020
Biometrika | VOL. 107

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Ensemble Estimation of Generalized Mutual Information With Applications to Genomics

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Information Theory