Abstract
BackgroundAlternative polyadenylation (APA) results in messenger RNA molecules with different 3′ untranslated regions (3’ UTRs), affecting the molecules’ stability, localization, and translation. APA is pervasive and implicated in cancer. Earlier reports on APA focused on 3’ UTR length modifications and commonly characterized APA events as 3’ UTR shortening or lengthening. However, such characterization oversimplifies the processing of 3′ ends of transcripts and fails to adequately describe the various scenarios we observe.ResultsWe built a cloud-based targeted de novo transcript assembly and analysis pipeline that incorporates our previously developed cleavage site prediction tool, KLEAT. We applied this pipeline to elucidate the APA profiles of 114 genes in 9939 tumor and 729 tissue normal samples from The Cancer Genome Atlas (TCGA). The full set of 10,668 RNA-Seq samples from 33 cancer types has not been utilized by previous APA studies. By comparing the frequencies of predicted cleavage sites between normal and tumor sample groups, we identified 77 events (i.e. gene-cancer type pairs) of tumor-specific APA regulation in 13 cancer types; for 15 genes, such regulation is recurrent across multiple cancers. Our results also support a previous report showing the 3’ UTR shortening of FGF2 in multiple cancers. However, over half of the events we identified display complex changes to 3’ UTR length that resist simple classification like shortening or lengthening.ConclusionsRecurrent tumor-specific regulation of APA is widespread in cancer. However, the regulation pattern that we observed in TCGA RNA-seq data cannot be described as straightforward 3’ UTR shortening or lengthening. Continued investigation into this complex, nuanced regulatory landscape will provide further insight into its role in tumor formation and development.
Highlights
Alternative polyadenylation (APA) results in messenger RNA molecules with different 3′ untranslated regions (3’ UTRs), affecting the molecules’ stability, localization, and translation
By comparing the cleavage site (CS) frequencies between normal and tumor sample groups, we identified widespread tumor-specific APA regulations that are recurrent across multiple cancer types
Cleavage site prediction on the cloud To select a cleavage site (CS) prediction tool, we benchmarked DaPars [13], KLEAT [21] and ContextMap 2 [20] with a universal human reference RNA-Seq library (https://basespace.illumina.com/datacentral, dataset name: “HiSeq 2500: TruSeq Stranded mRNA LT (SEQC: UHR & Brain)”, sample ID: mRNA-UHRR-C1), and compared their predictions to the CSs reported by PolyA-Seq, a data type derived from the universal human reference and which served as the ground truth for tool evaluation [19] (Additional file 1: Figure S1)
Summary
Alternative polyadenylation (APA) results in messenger RNA molecules with different 3′ untranslated regions (3’ UTRs), affecting the molecules’ stability, localization, and translation. Earlier reports on APA focused on 3’ UTR length modifications and commonly characterized APA events as 3’ UTR shortening or lengthening. APA is a form of differential isoform expression that pertains to the 3′ end This paradigm is limited in scope as it considers only pairs of cleavage sites (CSs), commonly referred to as the proximal and distal CSs. this paradigm is limited in scope as it considers only pairs of cleavage sites (CSs), commonly referred to as the proximal and distal CSs Such characterization used to be sufficient when the number of annotated CSs was small (29,283 in human [18]), and most genes with APA had only two CSs [15]. When only the medium-length 3’ UTR is upor down-regulated, neither shortening nor lengthening is appropriate
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.