Integrating many co-splicing networks to reconstruct splicing regulatory modules

Chao Dai,Wenyuan Li,Juan Liu,Xianghong Jasmine Zhou

doi:10.1186/1752-0509-6-s1-s17

Chao Dai, Wenyuan Li + Show 2 more

Open Access

https://doi.org/10.1186/1752-0509-6-s1-s17

Copy DOI

Abstract

BackgroundAlternative splicing is a ubiquitous gene regulatory mechanism that dramatically increases the complexity of the proteome. However, the mechanism for regulating alternative splicing is poorly understood, and study of coordinated splicing regulation has been limited to individual cases. To study genome-wide splicing regulation, we integrate many human RNA-seq datasets to identify splicing module, which we define as a set of cassette exons co-regulated by the same splicing factors.ResultsWe have designed a tensor-based approach to identify co-splicing clusters that appear frequently across multiple conditions, thus very likely to represent splicing modules - a unit in the splicing regulatory network. In particular, we model each RNA-seq dataset as a co-splicing network, where the nodes represent exons and the edges are weighted by the correlations between exon inclusion rate profiles. We apply our tensor-based method to the 38 co-splicing networks derived from human RNA-seq datasets and indentify an atlas of frequent co-splicing clusters. We demonstrate that these identified clusters represent potential splicing modules by validating against four biological knowledge databases. The likelihood that a frequent co-splicing cluster is biologically meaningful increases with its recurrence across multiple datasets, highlighting the importance of the integrative approach.ConclusionsCo-splicing clusters reveal novel functional groups which cannot be identified by co-expression clusters, particularly they can grant new insights into functions associated with post-transcriptional regulation, and the same exons can dynamically participate in different pathways depending on different conditions and different other exons that are co-spliced. We propose that by identifying splicing module, a unit in the splicing regulatory network can serve as an important step to decipher the splicing code.

Highlights

Alternative splicing is a ubiquitous gene regulatory mechanism that dramatically increases the complexity of the proteome
We show that co-splicing clusters can reveal novel functional groups that cannot be identified by co-expression clusters
The exons in a frequent co-splicing cluster can belong to different genes, but are very likely to be coregulated by the same splicing factors, forming a splicing module

Summary

Introduction

Alternative splicing is a ubiquitous gene regulatory mechanism that dramatically increases the complexity of the proteome. The mechanism for regulating alternative splicing is poorly understood, and study of coordinated splicing regulation has been limited to individual cases. To study genome-wide splicing regulation, we integrate many human RNA-seq datasets to identify splicing module, which we define as a set of cassette exons co-regulated by the same splicing factors. A central concept in transcription regulation is the transcription module, defined as a set of genes that are co-regulated by the same transcription factor(s). Such coordinated regulation occurs at the splicing level [4,5,6]. The exons in a splicing module can belong to different genes, but they exhibit correlated splicing patterns (in terms of being included or excluded in their respective transcripts) across different conditions, form an exon co-splicing cluster

Methods

Results

Conclusion