Discriminatory Target Learning: Mining Significant Dependence Relationships from Labeled and Unlabeled Data.

Zhi-Yi Duan,Ming-Hui Sun,Li-Min Wang,Hua Lou,Musa Mammadov

doi:10.3390/e21050537

Zhi-Yi Duan, Ming-Hui Sun + Show 3 more

Open Access

https://doi.org/10.3390/e21050537

Copy DOI

Journal: Entropy	Publication Date: May 26, 2019
Citations: 1	License type: CC BY 4.0

Affiliation: Jilin University, Deakin University

Abstract

Machine learning techniques have shown superior predictive power, among which Bayesian network classifiers (BNCs) have remained of great interest due to its capacity to demonstrate complex dependence relationships. Most traditional BNCs tend to build only one model to fit training instances by analyzing independence between attributes using conditional mutual information. However, for different class labels, the conditional dependence relationships may be different rather than invariant when attributes take different values, which may result in classification bias. To address this issue, we propose a novel framework, called discriminatory target learning, which can be regarded as a tradeoff between probabilistic model learned from unlabeled instance at the uncertain end and that learned from labeled training data at the certain end. The final model can discriminately represent the dependence relationships hidden in unlabeled instance with respect to different possible class labels. Taking k-dependence Bayesian classifier as an example, experimental comparison on 42 publicly available datasets indicated that the final model achieved competitive classification performance compared to state-of-the-art learners such as Random forest and averaged one-dependence estimators.

Highlights

With the rapid development of computer technologies, business and government organizations create large amounts of data, which need to be processed and analyzed
We argue that most Bayesian network classifiers (BNCs) (e.g., Naive Bayes (NB) and k-dependence Bayesian classifier (KDB)), which build only one model to fit training instances, cannot capture this difference and cannot represent the dependence relationships flexibly, especially hidden in unlabeled instances
We analyzed the performance in terms of zero-one loss, root mean square error (RMSE), bias and variance on 42 natural domains from the UCI Machine Learning Repository [29]

Summary

Introduction

With the rapid development of computer technologies, business and government organizations create large amounts of data, which need to be processed and analyzed. To satisfy the urgent need of mining knowledge hidden in the data, numerous machine learning models [1,2] (e.g., decision tree [3], Bayesian network [4,5], support vector machine [6] and Neural network [7]) have been proposed. (1) Increase structure complexity to represent more dependence relationships, e.g., convolutional neural network [8] and k-dependence Bayesian classifier (KDB) [9]. As structure complexity grows overfitting will inevitably appear, which will result in redundant dependencies and performance degradation. (2) Build ensemble of several individual members having relatively simple network structure, e.g., Random forest [10] and averaged one-dependence estimators (AODE) [11].

Results

Discussion

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Discriminatory Target Learning: Mining Significant Dependence Relationships from Labeled and Unlabeled Data.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Entropy

Lead the way for us

Similar Papers

Discriminative Structure Learning of Bayesian Network Classifiers from Training Dataset and Testing Instance.
Limin Wang ... Yang Liu
Entropy (Basel, Switzerland) | VOL. 21
Limin Wang, et. al.Limin Wang ... Yang Liu
13 May 2019
Entropy (Basel, Switzerland) | VOL. 21

Learning bayesian multinets from labeled and unlabeled data for knowledge representation
Meng Pang ... Kuo Li
Intelligent Data Analysis | VOL. 27
Meng Pang, et. al.Meng Pang ... Kuo Li
20 Nov 2023
Intelligent Data Analysis | VOL. 27

Hierarchical Independence Thresholding for learning Bayesian network classifiers
Yang Liu ... Minghui Sun
Knowledge-Based Systems | VOL. 212
Yang Liu, et. al.Yang Liu ... Minghui Sun
25 Nov 2020
Knowledge-Based Systems | VOL. 212

Using k-dependence causal forest to mine the most significant dependency relationships among clinical variables for thyroid disease diagnosis.
Limin Wang ... Vladimir B Bajic
PloS one | VOL. 12
Limin Wang, et. al.Limin Wang ... Vladimir B Bajic
17 Aug 2017
PloS one | VOL. 12

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Discriminatory Target Learning: Mining Significant Dependence Relationships from Labeled and Unlabeled Data.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Entropy