Discriminative Structure Learning of Bayesian Network Classifiers from Training Dataset and Testing Instance.

Limin Wang,Yang Liu,Musa Mammadov,Minghui Sun,Sikai Qi

doi:10.3390/e21050489

Abstract

Over recent decades, the rapid growth in data makes ever more urgent the quest for highly scalable Bayesian networks that have better classification performance and expressivity (that is, capacity to respectively describe dependence relationships between attributes in different situations). To reduce the search space of possible attribute orders, k-dependence Bayesian classifier (KDB) simply applies mutual information to sort attributes. This sorting strategy is very efficient but it neglects the conditional dependencies between attributes and is sub-optimal. In this paper, we propose a novel sorting strategy and extend KDB from a single restricted network to unrestricted ensemble networks, i.e., unrestricted Bayesian classifier (UKDB), in terms of Markov blanket analysis and target learning. Target learning is a framework that takes each unlabeled testing instance as a target and builds a specific Bayesian model Bayesian network classifiers (BNC) to complement BNC learned from training data . UKDB respectively introduced UKDB and UKDB to flexibly describe the change in dependence relationships for different testing instances and the robust dependence relationships implicated in training data. They both use UKDB as the base classifier by applying the same learning strategy while modeling different parts of the data space, thus they are complementary in nature. The extensive experimental results on the Wisconsin breast cancer database for case study and other 10 datasets by involving classifiers with different structure complexities, such as Naive Bayes (0-dependence), Tree augmented Naive Bayes (1-dependence) and KDB (arbitrary k-dependence), prove the effectiveness and robustness of the proposed approach.

Highlights

Since 1995, researchers have proposed to embed machine-learning techniques into a computer-aided system, such as medical diagnosis system [1,2,3,4]
For testing data, UKDBP proposes a natural way for dealing with missing values, not considering the dependence relationships related to missing values
Since the efficiency of the unrestricted k-dependence Bayesian classifier (UKDB) depends on the efficiency of MI and CMI, we use another criterion, pointwise mutual information (PMI) and pointwise conditional mutual information (PCMI) to compare and to show in which situations MI and CMI is more efficient

Summary

Introduction

Since 1995, researchers have proposed to embed machine-learning techniques into a computer-aided system, such as medical diagnosis system [1,2,3,4]. When P( xi , x j |c) < P( xi |c) ∗ P( x j |c) or log( P( xi , x j |c)/( P( xi |c) ∗ P( x j |c)) < 0, I ( xi ; x j |c) < 0 holds and we argue that the relationship between attribute values xi and x j can be considered to be conditional independence. There exist some negative values of I ( x1 ; x2 |c) that represent conditional independence, i.e., the dependence relationship may be different rather than invariant when attributes take different values. General BNCs (like NB, TAN and KDB), which only build one model to fit training instances, cannot capture this difference and cannot represent the dependence relationships flexibly. Taheri et al [23] proposed to build a dynamic structure without specifying k a priori, and they proved that the resulting BNC is optimal

The UKDB Algorithm

Results and Discussion

Evaluation Function

Experimental Study on WBC Dataset

The Effect of Values of k

The Effect of Missing Values

Results without Missing Values

Conclusions

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Entropy	Publication Date: May 13, 2019
Citations: 7	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Discriminative Structure Learning of Bayesian Network Classifiers from Training Dataset and Testing Instance.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Entropy

Lead the way for us

Similar Papers

Discriminatory Target Learning: Mining Significant Dependence Relationships from Labeled and Unlabeled Data.
Zhi-Yi Duan ... Ming-Hui Sun
Entropy | VOL. 21
Zhi-Yi Duan, et. al.Zhi-Yi Duan ... Ming-Hui Sun
26 May 2019
Entropy | VOL. 21

Efficient Heuristics for Structure Learning of k-Dependence Bayesian Classifier.
Yang Liu ... Minghui Sun
Entropy | VOL. 20
Yang Liu, et. al.Yang Liu ... Minghui Sun
22 Nov 2018
Entropy | VOL. 20

Learning bayesian multinets from labeled and unlabeled data for knowledge representation
Meng Pang ... Guo Lu
Intelligent Data Analysis | VOL. 27
Meng Pang, et. al.Meng Pang ... Guo Lu
20 Nov 2023
Intelligent Data Analysis | VOL. 27

Learning semi-lazy Bayesian network classifier under the c.i.i.d assumption
Yang Liu ... Musa Mammadov
Knowledge Based Systems | VOL. 208
Yang Liu, et. al.Yang Liu ... Musa Mammadov
19 Sep 2020
Knowledge Based Systems | VOL. 208

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Discriminative Structure Learning of Bayesian Network Classifiers from Training Dataset and Testing Instance.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Entropy