General statistical inference by an approximate application of the maximum entropy principle

Lian Yan Lian Yan,D.J Miller

doi:10.1109/nnsp.1999.788129

Abstract

We propose a learning method for building a general statistical inference engine, operating on discrete feature spaces. Such a model allows inference on any feature given values for the other features (or for a feature subset). Bayesian networks (BNs) are versatile tools that possess this inference capability. However, while the BN's explicit representation of conditional independencies is informative, this structure is not so easily learned. Typically, learning methods for BNs use (suboptimal) greedy search techniques. There is also a difficult issue of overfitting in these models. Alternatively, in Cheeseman (1983) proposed finding the maximum entropy (ME) joint probability mass function (pmf) consistent with arbitrary lower order probability constraints. This approach has some potential advantages over BNs. However, the huge complexity required for learning the joint pmf has severely limited the use of this approach until now. Here we propose an approximate ME method which also allows incorporation of arbitrary lower order constraints, but while retaining quite tractable learning complexity. The new method approximates the joint feature pmf (during learning) on a subgrid of the full feature space grid. Experimental results on the UC-Irvine repository reveal significant performance gains over two BN approaches: Chow and Liu's (1968) dependence trees and Herskovits and Cooper's (1991) Kutato. Several extensions of our approach are indicated.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

General statistical inference by an approximate application of the maximum entropy principle

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

General statistical inference for discrete and mixed spaces by an approximate application of the maximum entropy principle
Lian Yan ... D.J Miller
IEEE Transactions on Neural Networks | VOL. 11
Lian Yan, et. al. Lian Yan ... D.J Miller
01 May 2000
IEEE Transactions on Neural Networks | VOL. 11

Approximate maximum entropy joint feature inference consistent with arbitrary lower-order probability constraints: application to statistical classification
David J Miller ... Lian Yan
Neural computation | VOL. 12
David J Miller, et. al.David J Miller ... Lian Yan
01 Sep 2000
Neural computation | VOL. 12

Approximate maximum entropy learning for classification: comparison with other methods
Lian Yan ... D.J Miller
-
Lian Yan, et. al. Lian Yan ... D.J Miller
10 Sep 2001
10 Sep 2001

Approximate maximum entropy joint feature inference for discrete space classification
D.J Miller ... Lian Yan
-
D.J Miller, et. al.D.J Miller ... Lian Yan
10 Jul 1999
10 Jul 1999

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

General statistical inference by an approximate application of the maximum entropy principle

Abstract

Talk to us

Similar Papers