Abstract

The rapid increase in transcriptome data provides an opportunity to access the complex regulatory mechanisms in cellular systems through gene association network (GAN). Nonetheless, GANs derived from single datasets generally allow us to envisage only one side of the regulatory network, even under the particular condition of study. The circumstance is well demonstrated by inconsistent GANs of individual datasets proposed for similar experimental conditions, which always leads to ambiguous interpretation. Here, pan- and core-gene association networks (pan- and core-GANs), analogous to the pan- and core-genome concepts, are proposed to increase the power of inference through the integration of multiple, diverse datasets. The core-GAN represents the consensus associations of genes that were inferred from all individual networks. On the other hand, the pan-GAN represents the extensive gene-gene associations that occurred in each individual network. The pan- and core-GANs prospects were demonstrated based on three time series microarray datasets in leaves of Arabidopsis thaliana grown under diurnal conditions. We showed the overall performance of pan- and core-GANs was more robust to the number of data points in gene expression data compared to the GANs inferred from individual datasets. In addition, the incorporation of multiple data broadened our understanding of the biological regulatory system. While the pan-GAN enabled us to observe the landscape of gene association system, core-GAN highlighted the basic gene-associations in essence of the regulation regulating starch metabolism in leaves of Arabidopsis.

Highlights

  • The accuracy and precision of inferring gene association networks (GANs) and data interpretation are dependent on the amount and quality of the underlying data, analytical methods employed and the experimental design

  • We showed that under similar conditions, the GANs proposed to describe gene regulatory processes differed by the datasets employed in the co-expression analysis with respect to the network constituents, network performance and the biological insights conveyed by the networks

  • We compared the three GANs developed based on the time series microarray data on gene expression in leaves of Arabidopsis grown under diurnal conditions (S1 Fig), hereafter referred to as Smith-GAN, Blasing-GAN and Li-GAN

Read more

Summary

Introduction

The accuracy and precision of inferring gene association networks (GANs) and data interpretation are dependent on the amount and quality of the underlying data, analytical methods employed and the experimental design.

Methods
Results
Conclusion
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.