Abstract

Across species and tissues and especially in the mammalian brain, production of gene isoforms is widespread. While gene expression coordination has been previously described as a scale-free coexpression network, the properties of transcriptome-wide isoform production coordination have been less studied. Here we evaluate the system-level properties of cosplicing in mouse, macaque, and human brain gene expression data using a novel network inference procedure. Genes are represented as vectors/lists of exon counts and distance measures sensitive to exon inclusion rates quantifies differences across samples. For all gene pairs, distance matrices are correlated across samples, resulting in cosplicing or cotranscriptional network matrices. We show that networks including cosplicing information are scale-free and distinct from coexpression. In the networks capturing cosplicing we find a set of novel hubs with unique characteristics distinguishing them from coexpression hubs: heavy representation in neurobiological functional pathways, strong overlap with markers of neurons and neuroglia, long coding lengths, and high number of both exons and annotated transcripts. Further, the cosplicing hubs are enriched in genes associated with autism spectrum disorders. Cosplicing hub homologs across eukaryotes show dramatically increasing intronic lengths but stable coding region lengths. Shared transcription factor binding sites increase coexpression but not cosplicing; the reverse is true for splicing-factor binding sites. Genes with protein-protein interactions have strong coexpression and cosplicing. Additional factors affecting the networks include shared microRNA binding sites, spatial colocalization within the striatum, and sharing a chromosomal folding domain. Cosplicing network patterns remain relatively stable across species.

Highlights

  • There are several different strategies that can be used to analyze gene expression data (Allen et al, 2012; Jay et al, 2012)

  • Coexpression, Cosplicing, and CoSplicEx Network Construction The test dataset was new RNA-Seq data obtained from the striatum of heterogeneous stock-collaborative cross (HS-CC) mice; N = 60 and approximately 30 million reads were obtained per sample, unique alignment to the reference genome was > 85% and the upper quartile was used for normalization as in Bottomly et al (2011)

  • The advantages and disadvantages of RNA-Seq compared to microarrays to analyze the brain transcriptome have been discussed elsewhere (e.g., Hitzemann et al, 2013); one of the important advantages is that RNA-Seq provides detailed information on alternative exon usage which is unusually high in the brain, especially during brain development (Johnson et al, 2009)

Read more

Summary

Introduction

There are several different strategies that can be used to analyze gene expression data (Allen et al, 2012; Jay et al, 2012). The emphasis on gene connectivity frequently focuses on detecting highly connected “hub” genes that are potential targets for therapeutic manipulation Methods such as the WGCNA have been widely used for the analysis of microarray data, they have been only recently applied to RNA-Seq data, which has improved clustering metrics and provides exon-level resolution (Iancu et al, 2012b; Giorgi et al, 2013). Recent studies (Chen and Zheng, 2009; Dai et al, 2012) have shown that these exon data can be used to generate cosplicing networks that are distinct in structure and function from the coexpression networks These studies have demonstrated that individual exons from different genes can have correlated expression levels even when no correlation is detectable between the overall gene expression levels

Methods
Results
Conclusion
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.