Analysis of Information-Based Nonparametric Variable Selection Criteria.

Małgorzata Łazęcka,Jan Mielniczuk

doi:10.3390/e22090974

Abstract

We consider a nonparametric Generative Tree Model and discuss a problem of selecting active predictors for the response in such scenario. We investigated two popular information-based selection criteria: Conditional Infomax Feature Extraction (CIFE) and Joint Mutual information (JMI), which are both derived as approximations of Conditional Mutual Information (CMI) criterion. We show that both criteria CIFE and JMI may exhibit different behavior from CMI, resulting in different orders in which predictors are chosen in variable selection process. Explicit formulae for CMI and its two approximations in the generative tree model are obtained. As a byproduct, we establish expressions for an entropy of a multivariate gaussian mixture and its mutual information with mixing distribution.

Highlights

In the paper, we consider theoretical properties of Conditional Mutual Information (CMI) and its approximations in a certain dependence model called Generative Tree Model (GTM)
We will prove some results on information-theoretic properties of gaussian mixtures which are necessary to analyze the behavior of CMI, Conditional Infomax Feature Extraction (CIFE), and Joint Mutual information (JMI) in Generative
We define a special gaussian Generative Tree Model and investigate how greedy procedure based on (14), as well as its analogues when CMI is replaced by JMI and CIFE, behaves in this model

Summary

Introduction

We consider theoretical properties of Conditional Mutual Information (CMI) and its approximations in a certain dependence model called Generative Tree Model (GTM). CMI and its modifications are used in many problems of machine learning including feature selection, variable importance ranking, causal discovery, and structure learning of dependence networks (see, e.g., Reference [1,2]). We stress that our approach is intrinsically nonparametric and focuses on using nonparametric measures of conditional dependence for feature selection By studying their theoretical behavior for this task we learn an average behavior of their empirical counterparts for large sample sizes. Besides its explainable dependence structure, distributions of predictors in the considered model are mixed gaussians, and this facilitates calculation of explicit form of information-based selection criteria.

Preliminaries

Information-Theoretic Measures of Dependence

Information-Based Feature Selection

Approximations of CMI

Auxiliary Results

Main Results

Generative Tree Model

Behavior of CMI

Behavior of JMI

Behavior of CIFE and Its Comparison with JMI

Conclusions

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Entropy (Basel, Switzerland)	Publication Date: Aug 31, 2020
Citations: 3	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Analysis of Information-Based Nonparametric Variable Selection Criteria.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Entropy (Basel, Switzerland)

Lead the way for us

Similar Papers

Feature Selection Method Based on Maximum Conditional and Joint Mutual Information
Longbao Wang ... Jun Qian
-
Longbao Wang, et. al.Longbao Wang ... Jun Qian
01 Jul 2019
01 Jul 2019

Rényi generalizations of the conditional quantum mutual information
Mario Berta ... Mark M Wilde
Journal of Mathematical Physics | VOL. 56
Mario Berta, et. al.Mario Berta ... Mark M Wilde
01 Feb 2015
Journal of Mathematical Physics | VOL. 56

Conditional mutual inclusive information enables accurate quantification of associations in gene regulatory networks.
Xiujun Zhang ... Xing-Ming Zhao
Nucleic acids research | VOL. 43
Xiujun Zhang, et. al.Xiujun Zhang ... Xing-Ming Zhao
24 Dec 2014
Nucleic acids research | VOL. 43

Individually Conditional Individual Mutual Information Bound on Generalization Error
Ruida Zhou ... Tie Liu
-
Ruida Zhou, et. al.Ruida Zhou ... Tie Liu
12 Jul 2021
12 Jul 2021

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Analysis of Information-Based Nonparametric Variable Selection Criteria.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Entropy (Basel, Switzerland)