Abstract

The Markov blanket (MB) in Bayesian networks has attracted much attention since the MB of a target attribute (T) is the minimal feature subset with maximum prediction ability for classification. Nevertheless, traditional MB discovery methods such as IAMB, HITON-MB, and MMMB are not suitable for streaming features. Meanwhile, online feature selection with streaming features (OSFSF) methods such as Alpha-investing and SAOLA, focus only on the relevance and ignores causality of the features, they cannot mine the MB, or only find the parents and children (PC) such as OSFS. Therefore, these methods have weaker interpretability and do not have sufficient prediction accuracy. We propose a novel algorithm for online markov blanket discovery with streaming features to tackle abovementioned issues, named OMBSF. OMBSF finds the MB of T containing parents, children, and spouses, discards false positives from the PC and spouse sets online, distinguishes between the PC and spouse in real-time. An empirical study demonstrates that OMBSF finds a more accurate MB when the volume of features is high and it significantly improves the prediction accuracy than other algorithms. Moreover, OMBSF obtains a bigger feature subset than that obtained by OSFS, demonstrating that OMBSF can identify the spouse in the MB that are not identified using OSFS.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.