Abstract
BackgroundOne of the goals of the Systems Biology community is to have a detailed map of all biological interactions in an organism. One small yet important step in this direction is the creation of biological networks from post-genomic data. Bayesian networks are a very promising model for the inference of regulatory networks in Systems Biology. Usually, Bayesian networks are sampled with a Markov Chain Monte Carlo (MCMC) sampler in the structure space. Unfortunately, conventional MCMC sampling schemes are often slow in mixing and convergence. To improve MCMC convergence, an alternative method is proposed and tested with different sets of data. Moreover, the proposed method is compared with the traditional MCMC sampling scheme.ResultsIn the proposed method, a simpler and faster method for the inference of regulatory networks, Graphical Gaussian Models (GGMs), is integrated into the Bayesian network inference, trough a Hierarchical Bayesian model. In this manner, information about the structure obtained from the data with GGMs is taken into account in the MCMC scheme, thus improving mixing and convergence. The proposed method is tested with three types of data, two from simulated models and one from real data. The results are compared with the results of the traditional MCMC sampling scheme in terms of network recovery accuracy and convergence. The results show that when compared with a traditional MCMC scheme, the proposed method presents improved convergence leading to better network reconstruction with less MCMC iterations.ConclusionsThe proposed method is a viable alternative to improve mixing and convergence of traditional MCMC schemes. It allows the use of Bayesian networks with an MCMC sampler with less iterations. The proposed method has always converged earlier than the traditional MCMC scheme. We observe an improvement in accuracy of the recovered networks for the Gaussian simulated data, but this improvement is absent for both real data and data simulated from ODE.Electronic supplementary materialThe online version of this article (doi:10.1186/s12859-015-0734-6) contains supplementary material, which is available to authorized users.
Highlights
One of the goals of the Systems Biology community is to have a detailed map of all molecular interactions in an organism
Node is conditionally independent of its non descendants given its parents characterizes a simple and unique rule for expanding the joint probability in terms of simpler conditional probabilities. In accordance with this property, it is mandatory that a Bayesian Networks (BNs) be a directed acyclic graph (DAG)
We have introduced the proposed hierarchical Bayesian model, BNGGM, and its sampling scheme
Summary
One of the goals of the Systems Biology community is to have a detailed map of all biological interactions in an organism. Bayesian networks are a very promising model for the inference of regulatory networks in Systems Biology. To improve MCMC convergence, an alternative method is proposed and tested with different sets of data. The proposed method is compared with the traditional MCMC sampling scheme. One of the goals of the Systems Biology community is to have a detailed map of all molecular interactions in an organism. Much work remains to achieve this goal, the inference of biological networks has become an important tool in Systems Biology. In the last few years, several methods for the reconstruction of regulatory networks and biochemical pathways from data have been proposed; see, for instance, [1,2,3,4]. Among various approaches for inferring networks, Bayesian Networks (BNs) are very attractive due to their probabilistic nature and flexibility in incorporating interventions and extra sources of information
Published Version (Free)
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have