Learning complex dependency structure of gene regulatory networks from high dimensional microarray data with Gaussian Bayesian networks

Catharina E Graafland,José M Gutiérrez

doi:10.1038/s41598-022-21957-z

Abstract

Reconstruction of Gene Regulatory Networks (GRNs) of gene expression data with Probabilistic Network Models (PNMs) is an open problem. Gene expression datasets consist of thousand of genes with relatively small sample sizes (i.e. are large-p-small-n). Moreover, dependencies of various orders coexist in the datasets. On the one hand transcription factor encoding genes act like hubs and regulate target genes, on the other hand target genes show local dependencies. In the field of Undirected Network Models (UNMs)—a subclass of PNMs—the Glasso algorithm has been proposed to deal with high dimensional microarray datasets forcing sparsity. To overcome the problem of the complex structure of interactions, modifications of the default Glasso algorithm have been developed that integrate the expected dependency structure in the UNMs beforehand. In this work we advocate the use of a simple score-based Hill Climbing algorithm (HC) that learns Gaussian Bayesian networks leaning on directed acyclic graphs. We compare HC with Glasso and variants in the UNM framework based on their capability to reconstruct GRNs from microarray data from the benchmarking synthetic dataset from the DREAM5 challenge and from real-world data from the Escherichia coli genome. We conclude that dependencies in complex data are learned best by the HC algorithm, presenting them most accurately and efficiently, simultaneously modelling strong local and weaker but significant global connections coexisting in the gene expression dataset. The HC algorithm adapts intrinsically to the complex dependency structure of the dataset, without forcing a specific structure in advance.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Scientific Reports	Publication Date: Nov 4, 2022
Citations: 2	License type: open-access

R Discovery Prime

R Discovery Prime

Learning complex dependency structure of gene regulatory networks from high dimensional microarray data with Gaussian Bayesian networks

Abstract

Talk to us

Similar Papers

More From: Scientific Reports

Lead the way for us

Similar Papers

Learning the structure of gene regulatory networks from time series gene expression data
Haoni Li ... Nan Wang
BMC Genomics | VOL. 12
Haoni Li, et. al.Haoni Li ... Nan Wang
01 Dec 2011
BMC Genomics | VOL. 12

Reconstruction of dynamic regulatory networks reveals signaling-induced topology changes associated with germ layer specification.
Emily Y Su ... Qin Bian
Stem Cell Reports | VOL. 17
Emily Y Su, et. al.Emily Y Su ... Qin Bian
27 Jan 2022
Stem Cell Reports | VOL. 17

A Fuzzy Data Mining Technique for the Reconstruction of Gene Regulatory Networks from Time Series Expression Data
Patrick C H Ma ... Keith C C Chan
-
Patrick C H Ma, et. al.Patrick C H Ma ... Keith C C Chan
01 Sep 2006
01 Sep 2006

INFERENCE OF GENE REGULATORY NETWORKS FROM MICROARRAY DATA: A FUZZY LOGIC APPROACH
Patrick C.H Ma ... Keith C.C Chan
-
Patrick C.H Ma, et. al.Patrick C.H Ma ... Keith C.C Chan
01 Dec 2005
01 Dec 2005

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Learning complex dependency structure of gene regulatory networks from high dimensional microarray data with Gaussian Bayesian networks

Abstract

Talk to us

Similar Papers

More From: Scientific Reports