Abstract

Transcript function is determined by sequence elements arranged on an individual RNA molecule. Variation in transcripts can affect mRNA stability, localization, and translation2, or give rise to truncated proteins with differing subcellular localizations3 and functions4. Given the existence of overlapping, variable transcript isoforms, determining the functional impact of the transcriptome requires identification of full-length transcripts, rather than just the genomic regions that are transcribed5,6. Here, by jointly determining both transcript ends for millions of RNA molecules (TIF-Seq), we reveal an extensive layer of isoform diversity previously hidden among overlapping RNA molecules. Variation in transcript boundaries appears to be the rule rather than the exception, even within a single population of yeast cells. Over 26 major transcript isoforms per protein-coding gene were expressed in yeast. Hundreds of short coding RNAs and truncated versions of proteins are concomitantly encoded by alternative transcript isoforms, increasing protein diversity. In addition, ~70% of genes express alternative isoforms that vary in posttranscriptional regulatory elements, and tandem genes frequently produce overlapping or even bicistronic transcripts. This extensive transcript diversity is generated by a relatively simple eukaryotic genome with limited splicing, and within a genetically homogeneous population of cells. Our findings have implications for genome compaction, evolution, and phenotypic diversity between single cells. They also suggest that isoform diversity as well as RNA abundance should be considered when assessing the functional repertoire of genomes.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call