Mutual Information between Discrete and Continuous Data Sets

Brian C Ross

doi:10.1371/journal.pone.0087357

Abstract

Mutual information (MI) is a powerful method for detecting relationships between data sets. There are accurate methods for estimating MI that avoid problems with “binning” when both data sets are discrete or when both data sets are continuous. We present an accurate, non-binning MI estimator for the case of one discrete data set and one continuous data set. This case applies when measuring, for example, the relationship between base sequence and gene expression level, or the effect of a cancer drug on patient survival time. We also show how our method can be adapted to calculate the Jensen–Shannon divergence of two or more data sets.

Highlights

Mutual information (MI) [1] is in several ways a perfect statistic for measuring the degree of relatedness between data sets
MI will detect any sort of relationship between data sets whatsoever, whether it involves the mean values or the variances or higher moments
MI has a straightforward interpretation as the amount of shared information between data sets; other statistics such as rankordering are harder to interpret

Summary

Introduction

Mutual information (MI) [1] is in several ways a perfect statistic for measuring the degree of relatedness between data sets. We can apply our method to estimate the weighted JS divergence, by storing samples from each distribution to be compared in the continuous data set Y , and using the discrete data set X to identify which distribution each sample was drawn from.

Results

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: PLoS ONE	Publication Date: Feb 19, 2014
Citations: 599	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Mutual Information between Discrete and Continuous Data Sets

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: PLoS ONE

Lead the way for us

Similar Papers

Calculator for Mutual Information Between a Discrete and a Continuous Data Set
Brian Ross
Biophysical Journal | VOL. 106
Brian RossBrian Ross
01 Jan 2014
Biophysical Journal | VOL. 106

A method for continuous-range sequence analysis with Jensen-Shannon divergence
Miguel Ángel Ré ... Guillermo Gabriel Aguirre Varela
Papers in Physics | VOL. 13
Miguel Ángel Ré, et. al.Miguel Ángel Ré ... Guillermo Gabriel Aguirre Varela
05 Feb 2021
Papers in Physics | VOL. 13

Obstructive sleep apnea predicts 10-year cardiovascular disease-related mortality in the Sleep Heart Health Study: a machine learning approach.
Ao Li ... Linda S Powers
Journal of clinical sleep medicine : JCSM : official publication of the American Academy of Sleep Medicine | VOL. 18
Ao Li, et. al.Ao Li ... Linda S Powers
26 Aug 2021
Journal of clinical sleep medicine : JCSM : official publication of the American Academy of Sleep Medicine | VOL. 18

Biologically inspired informatics; algorithm for discrete data and signal processing
Sandor J Piros ... Peter Korondi
-
Sandor J Piros, et. al.Sandor J Piros ... Peter Korondi
01 Jul 2011
01 Jul 2011

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Mutual Information between Discrete and Continuous Data Sets

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: PLoS ONE