Abstract

Membrane organization describes the orientation of a protein with respect to the membrane and can be determined by the presence, or absence, and organization within the protein sequence of two features: endoplasmic reticulum signal peptides and alpha-helical transmembrane domains. These features allow protein sequences to be classified into one of five membrane organization categories: soluble intracellular proteins, soluble secreted proteins, type I membrane proteins, type II membrane proteins, and multi-spanning membrane proteins. Generation of protein isoforms with variable membrane organizations can change a protein's subcellular localization or association with the membrane. Application of MemO, a membrane organization annotation pipeline, to the FANTOM3 Isoform Protein Sequence mouse protein set revealed that within the 8,032 transcriptional units (TUs) with multiple protein isoforms, 573 had variation in their use of signal peptides, 1,527 had variation in their use of transmembrane domains, and 615 generated protein isoforms from distinct membrane organization classes. The mechanisms underlying these transcript variations were analyzed. While TUs were identified encoding all pairwise combinations of membrane organization categories, the most common was conversion of membrane proteins to soluble proteins. Observed within our high-confidence set were 156 TUs predicted to generate both extracellular soluble and membrane proteins, and 217 TUs generating both intracellular soluble and membrane proteins. The differential use of endoplasmic reticulum signal peptides and transmembrane domains is a common occurrence within the variable protein output of TUs. The generation of protein isoforms that are targeted to multiple subcellular locations represents a major functional consequence of transcript variation within the mouse transcriptome.

Highlights

  • The murine transcriptome was redefined based on the sequences generated from the RIKEN FANTOM3 fulllength mRNAs combined with the full-length mRNA sequences available in GenBank [1]

  • This level of transcript variation is consistent with previous studies that estimate that 30%–60% of mammalian genes are alternatively spliced [1,3], there is evidence indicating that the true level of alternative splicing may be greater than 60% [4]

  • The inclusion of multiple protein isoforms within the Isoform Protein Sequence (IPS) dataset did not alter the proportional distribution of these protein features when compared with the results observed in previously analyzed representative proteins sets from mouse and other species [10]

Read more

Summary

Introduction

The murine transcriptome was redefined based on the sequences generated from the RIKEN FANTOM3 fulllength mRNAs combined with the full-length mRNA sequences available in GenBank [1] These transcripts were grouped into 43,539 transcriptional units (TUs), where a TU is a group of transcripts arising from a single genomic locus [1,2]. Of these TUs, 18,802 (38.6%) contained at least two variable spliced transcripts generated via alternative splicing and/or the use of alternative transcriptional initiation or termination sites. We systematically search the mouse proteome for variation in protein features that define membrane organization

Methods
Results
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call