KBoost: a new method to infer gene regulatory networks from gene expression data

Luis F Iglesias-Martinez,Barbara De Kegel,Walter Kolch

doi:10.1038/s41598-021-94919-6

Luis F Iglesias-Martinez, Barbara De Kegel + Show 1 more

Open Access

https://doi.org/10.1038/s41598-021-94919-6

Copy DOI

Journal: Scientific Reports	Publication Date: Jul 29, 2021
Citations: 14	License type: open-access

Affiliation: University College Dublin

Abstract

Reconstructing gene regulatory networks is crucial to understand biological processes and holds potential for developing personalized treatment. Yet, it is still an open problem as state-of-the-art algorithms are often not able to process large amounts of data within reasonable time. Furthermore, many of the existing methods predict numerous false positives and have limited capabilities to integrate other sources of information, such as previously known interactions. Here we introduce KBoost, an algorithm that uses kernel PCA regression, boosting and Bayesian model averaging for fast and accurate reconstruction of gene regulatory networks. We have benchmarked KBoost against other high performing algorithms using three different datasets. The results show that our method compares favorably to other methods across datasets. We have also applied KBoost to a large cohort of close to 2000 breast cancer patients and 24,000 genes in less than 2 h on standard hardware. Our results show that molecularly defined breast cancer subtypes also feature differences in their GRNs. An implementation of KBoost in the form of an R package is available at: https://github.com/Luisiglm/KBoost and as a Bioconductor software package.

Highlights

Reconstructing gene regulatory networks is crucial to understand biological processes and holds potential for developing personalized treatment
The results show that KBoost performs well in both datasets (Table 1)
The results show that KBoost compares favorably to most algorithms and has a similar overall performance as ENNET, a tree gradient boosting algorithm (Table 3)

Summary

Introduction

Reconstructing gene regulatory networks is crucial to understand biological processes and holds potential for developing personalized treatment. Several groups have used different algorithms based on different mathematical formulations to infer GRNs from gene expression data These include Bayesian networks, correlation metrics, mutual information methods and parametric and non-parametric regression. A seminal paper published in 2012 showed that correlation, mutual information and Bayesian networks tended to perform far worse than methods based on r egression[3] For this reason, in this work we focused only on regression based GRN inference methods. Regression based GRN inference methods build a mathematical model of the expression levels of a target gene given the expression levels of different TFs. The central assumption in these methods is that if the expression level of a TF predicts the expression level of a target gene it is likely regulating it.

Methods

Results

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

KBoost: a new method to infer gene regulatory networks from gene expression data

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Scientific Reports

Lead the way for us

Similar Papers

Reconstruction of dynamic regulatory networks reveals signaling-induced topology changes associated with germ layer specification.
Emily Y Su ... Qin Bian
Stem Cell Reports | VOL. 17
Emily Y Su, et. al.Emily Y Su ... Qin Bian
27 Jan 2022
Stem Cell Reports | VOL. 17

JRmGRN: joint reconstruction of multiple gene regulatory networks with common hub genes using data from multiple tissues or conditions.
Wenping Deng ... Patrick X Zhao
Bioinformatics | VOL. 34
Wenping Deng, et. al.Wenping Deng ... Patrick X Zhao
30 Apr 2018
Bioinformatics | VOL. 34

Reconstruction of gene regulatory networks with multi-objective particle swarm optimisers
Sandro Hurtado ... José F Aldana-Montes
Applied Intelligence | VOL. 51
Sandro Hurtado, et. al.Sandro Hurtado ... José F Aldana-Montes
23 Oct 2020
Applied Intelligence | VOL. 51

Exploiting transfer learning for the reconstruction of the human gene regulatory network.
Paolo Mignone ... Michelangelo Ceci
Bioinformatics | VOL. 36
Paolo Mignone, et. al.Paolo Mignone ... Michelangelo Ceci
14 Oct 2019
Bioinformatics | VOL. 36

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

KBoost: a new method to infer gene regulatory networks from gene expression data

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Scientific Reports