Abstract

The Bayesian network (BN) is a probability inference model to describe the explicit relationship between cause and effect, which may be examined in the complex system of rice price with data uncertainty. However, discovering the optimized structure from a super-exponential number of graphs in the search space is an NP-hard problem. In this paper, Bayesian Maximal Information Coefficient (BMIC) is proposed to uncover the causal correlations from a large data set in a random system by integrating probabilistic graphical model (PGM) and maximal information coefficient (MIC) with Bayesian linear regression (BLR). First, MIC is to capture the strong dependence between predictor variables and a target variable to reduce the number of variables for the BN structural learning of PGM. Second, BLR is responsible for assigning orientation in a graph resulting from a posterior probability distribution. It conforms to what BN needs to acquire a conditional probability distribution when given the parents for each node by the Bayes’ Theorem. Third, the Bayesian information criterion (BIC) is treated as an indicator to determine the well-explained model with its data to ensure correctness. The score shows that the proposed BMIC obtains the highest score compared to the two traditional learning algorithms. Finally, the proposed BMIC is applied to discover the causal correlations from the large data set on Thai rice price by identifying the causal changes in the paddy price of Jasmine rice. The results of the experiments show that the proposed BMIC returns directional relationships with clues to identify the cause(s) and effect(s) of paddy price with a better heuristic search.

Highlights

  • E Mathematics Subject Classification (2010) 62M10 · 62P20 · 91B84 strong dependence between predictor variables and a target variable to reduce the number of variables for 1 Introduction the Bayesian network (BN) structural learning of Probabilistic Graphical Model (PGM)

  • The Bayesian maximal information coefficient (BMIC) is applied to discover the causal correlations from the large dataset on Thai rice price by identifying causality change in the paddy price of Jas-With data uncertainty, one important issue is found when data keeps growing in volume, variety, and velocity

  • Since data uncertainty can be detected in the price of Thai rice and relevant attributes, the use of BN probability inference gives a better explanation

Read more

Summary

Introduction

E Mathematics Subject Classification (2010) 62M10 · 62P20 · 91B84 strong dependence between predictor variables and a target variable to reduce the number of variables for 1 Introduction the BN structural learning of PGM. Probabilistic Graphical Model (PGM) (Marloes et al, 2013) illustrates causal relationships by probability and graph It (Koller and Friedman, 2009) is a robust framework to handle uncertainty predictions. The correlated and tuple uncertainty and their relationships are demonstrated by a joint probability distribution.(Singh et al, 2008) Bayesian network (BN) presents a directed acyclic graph (DAG) on a joint probability distribution using a conditional probability of variables from the Bayes’ theorem along with machine learning. PGM’s structured learning process encounters the NP-hard problem(Chickering, 1996) when all observed variables are required to construct the DAG It implies that the super-exponential number of graphical models is in the search space. The more graphs in the search space, the more time, memory, and cost are required to discover the best network structure

Objectives
Results
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call