Robust gene coexpression networks using signed distance correlation.

Javier Pardo-Diaz,Lyuba V Bozhilova,Mariano Beguerisse-Díaz,Philip S Poole,Charlotte M Deane,Gesine Reinert

doi:10.1093/bioinformatics/btab041

Javier Pardo-Diaz, Lyuba V Bozhilova + Show 4 more

Open Access

https://doi.org/10.1093/bioinformatics/btab041

Copy DOI

Journal: Bioinformatics	Publication Date: Feb 1, 2021
Citations: 13	License type: CC BY 4.0

Affiliation: University of Oxford

Abstract

MotivationEven within well-studied organisms, many genes lack useful functional annotations. One way to generate such functional information is to infer biological relationships between genes/proteins, using a network of gene coexpression data that includes functional annotations. However, the lack of trustworthy functional annotations can impede the validation of such networks. Hence, there is a need for a principled method to construct gene coexpression networks that capture biological information and are structurally stable even in the absence of functional information.ResultsWe introduce the concept of signed distance correlation as a measure of dependency between two variables, and apply it to generate gene coexpression networks. Distance correlation offers a more intuitive approach to network construction than commonly used methods, such as Pearson correlation and mutual information. We propose a framework to generate self-consistent networks using signed distance correlation purely from gene expression data, with no additional information. We analyse data from three different organisms to illustrate how networks generated with our method are more stable and capture more biological information compared to networks obtained from Pearson correlation or mutual information.Availability and implementationCode is available online (https://github.com/javier-pardodiaz/sdcorGCN). Supplementary information Supplementary data are available at Bioinformatics online.

Highlights

IntroductionWhile noisy, contains key information about biological processes (Kothapalli et al, 2002)
Gene expression data, while noisy, contains key information about biological processes (Kothapalli et al, 2002)
Using STRING, we show that networks from signed distance correlation capture more biological information and are structurally more stable than networks based on Pearson or Spearman correlation or mutual information

Summary

Introduction

While noisy, contains key information about biological processes (Kothapalli et al, 2002). One motivation behind creating these networks is that genes which are coexpressed across multiple samples are likely to have related functions (Hughes et al, 2000; Makrodimitris et al, 2020; Stuart et al, 2003; van Noort et al, 2003), allowing inference of gene function using guilt by association approaches (Wolfe et al, 2005). This procedure is especially useful if the studied organism is poorly annotated. The lack of reliable genomic functional information may hinder the construction of gene coexpression networks and the validation of their accuracy

Methods

Results

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Robust gene coexpression networks using signed distance correlation.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Bioinformatics

Lead the way for us

Similar Papers

Generating weighted and thresholded gene coexpression networks using signed distance correlation.
Javier Pardo-Diaz ... Philip S Poole
Network Science | VOL. 10
Javier Pardo-Diaz, et. al.Javier Pardo-Diaz ... Philip S Poole
01 Jun 2022
Network Science | VOL. 10

Annotation of gene function in citrus using gene expression information and co-expression networks.
Darren Cj Wong ... Crystal Sweetman
BMC Plant Biology | VOL. 14
Darren Cj Wong, et. al.Darren Cj Wong ... Crystal Sweetman
15 Jul 2014
BMC Plant Biology | VOL. 14

Gene Coexpression Network Analysis as a Source of Functional Annotation for Rice Genes
Kevin L Childs ... Rebecca M Davidson
PLoS ONE | VOL. 6
Kevin L Childs, et. al.Kevin L Childs ... Rebecca M Davidson
22 Jul 2011
PLoS ONE | VOL. 6

Identify mechanism of the prognosis of pancreatic ductal adenocarcinoma using gene expression profiles and co-expression network analysis
...
-
, et. al. ...
28 Sep 2014
28 Sep 2014

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Robust gene coexpression networks using signed distance correlation.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Bioinformatics